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(54) MOLECULE THAT HOMOLOGI2ES GENOTYPE AND PHENOTYPE AND UTILIZATION 
THEREOF 



(57) A molecule that homologizes a genotype and a 
phenotype and is prepared by covalentty joining the 3* 
end of a nucleic acid portion having a base sequence 
reflecting a genotype to the C-terminus of a protein por- 
tion containing a protein participating in phenotypie 
expression; and a method of constructing a molecule 
that homologizes a genotype and a phenotype, which 
comprises the steps of (a) preparing a DNA containing 
a gene free from termination codon, (b) transcribing the 
DNA into a RNA, (c) joining a spacer comprising a DNA- 
RN A chimera to the 3* end of the RN A t (d) further joining 
to the 3' end of the product of ligation a nucleoside or a 
substance having a chemical structure analogous to 
that of the nucleoside, the nucleoside and the sub- 
stance analogous thereto being capable of forming a 
covatent bond with an amino acid or a substance having 
a chemical structure analogous to that of the amino 
acid, and (e) synthesizing a protein in a cell-free protein 
synthesis system by using the product of ligation as a 
mRNA to thereby join the gene-containing nucleic acid 
portion to the product of gene translation. This molecule 
is extremely useful in evolutionary molecular engineer- 
ing for the modification of functional biopolymers, such 
as enzymes, antfoodies and ribozymes, and the crea- 
tion of biopolymers having functions which living organ- 
isms do not possess. 
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5 [0001] The present invention relates to a molecule assigning a genotype to a phenotype. More specifically, it relates 1 
to a molecule assigning a genotype to a phenotype, comprising a nucleic acid portion having a nucleotide sequence 
reflecting the genotype and a protein portion comprising a protein involved in exhibition of the phenotype. The molecule 
assigning the genotype to the phenotype of the present invention is a highly useful substance that can be utilized in evo- 
lutionary molecular engineering such as' in the modification of enzymes, antibodies, ribozymes and other such func- 

io tional biopolymers and creation of biopolymers having functions not found in living organisms. 

[0002] Through advances in biochemistry, molecular,biology and biophysics, It has been learned that living organisms 
are molecular machines which function and propagate by interactions among molecules. Among the characteristics, of 
earth's living organisms, the fundamentals are their preservation of Genetic information In DNA nucleotide sequences 
and their ability to translate this information into functional proteins through the medium of mRNA. Owing to progress 

is in genetic engineering, biopolymers with given sequences, like nucleotides and peptides, can now be easily synthe- 
sized. Protein engineering and RNA engineering, today a focus of attention, owe their existence to genetic engineering. 
The aims of protein engineering and RNA engineering are to solve the puzzle of the three-dimensional structures 
required for proteins and RNA fulfilling specific functions and to enable humans to freely design proteins and RNA pos- 
sessing desired functions. Because of the diversity and complexity of these structures and the difficulty of a theoretical 

20 approach to their three-dimensional structures, however, current protein engineering and RNA engineering are still at 
the stage of modifying some of residues at active sites and observing changes in the structure and functions. Human 
knowledge has thus not yet reached the stage of designing proteins and RNA. 

[0003] Understanding the functions of biopolymers in their relationship to the elemental processes of higher life phe- 
nomena will require elucidation of the correlation between protein molecular structure and function. The line of thought 

25 we take in the following is not only to make the best of "human knowledge" but also to take advantage of the "wisdom 
of nature." This is because we concluded that we would have to acquire the ability to put both to work in order to over- 
come the current difficulties of protein engineering and move forward with the design and production of functional 
biopolymers. When the classical methods are diverted to the design of "proteins with new functions and activities, the 
difficulty of protein design by site-specific mutations can sometimes be avoided. This can be called "taking advantage 

30 of the wisdom of nature." 

[0004] Although the drawback of this method is the difficulty of screening to identify mutants with new functions and 
activities, this difficulty is overcome by the RNA catalysts that have recently come into the spotlight. Attempts have been 
made to select an RNA with specific characteristics from among RNAs synthesized to have an extremely large number 
of random sequences (about 10 13 types) (Ellington, A. D. & Szostak, J. W. (1990) Nature, 346, 818-822). - • ■ 
35 [0005]' This is an examplebf evolutionary molecular* engineering. As typified by this example/the primary igoal in the 
evolutionary molecular engineering of proteins is to find out optimum sequences by searching an expansive sequence 
space of a scale unimaginable in conventional protein engineering. By "making the best of human knowledge" to devise 
a screening system tor this, it will be possible to discover numerous quasi-optimum sequences around the optimum 
sequences and thus to construct an experimental system for studying "sequence v6 function." 
40 [0006] The remarkable functions of living bodies were acquired through the process of evolution. Therefore, if evolu- 
tion can be replicated, it should be possble to modify enzymes, antibodies, ribozymes and other functional btopolymers 
and, further, to create biopolymers with functions not found In living organisms In the laboratory. Needless to say, 
research on protein modification and creation is an object of utmost importance to various aspects of biotechnology 
such as utilization of enzymes as industrial catalysts, biochips, biosensors and sugar-chain engineering. 
45 [0007] Given the fact thai molecular design utilizing structural theory is, as symbolized by the continuing high regard ■ 
for "screening." still in an unperfected state, the evolutionary technique has a practical value for utilization in selecting 
useful proteins as a more efficient strategy. Building a "time machine" capable of more efficiently producing evolution in 
a laboratory, if such were possible, would not only enable modification of enzymes, antibodies (vaccines, monoclonal 
antibodies etc.) and other existing proteins but also open the way to the creation of enzymes for decomposing environ- 
so mental contaminants, purifiers and others and new proteins not present in the biological world. Jf an experimental sys- 
tem for protein evolution can be e stablished, therefore, it can be expected to be aggressively utilizable for application in 
a wide range of fields including power saving and energy preservation in industrial processes, energy production and 
environmental preservation. The assigning molecule of the present invention is a highly useful substance in protein 
modification and other aspects of evolutionary molecular engineering. 



55 



Background Art 

[0008] Evolutionary molecular engineering is a field of study that attempts to conduct molecular design ol functional 
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polymers by utilizing hlg'h-cpeed Wecular evolution In the laboratory. I.e.; by laboratory irwestipatlon and optirrtzattoh 
of the adaptive locomotion of biopolymers in sequence space. It Is a completely new mdecular biotechnology that first 
produced substantial results in 1990 (Yuzuru Husimi (1991) Kagaku.'61 , 333-340; Yuzuru Husiml (199?) Kbza Shinka, 
Vol. 6, University of Tokyo Publishing Society). ' , 

[0009] Life is a product of molecular evolution and natural selection. The evolution of molecules Is a universal life phe- 
nomenon but its mechanism is not something that can be elucidated by studies that track the history of past evolution. 
Rather, the approach of constructing and studying the behavior of 6imple molecules and life systems that evolve in the 
laboratory better provides fundamental knowledge regarding molecular evolution and enables establishment of a veri- 
fiable theory applicable In molecular engineering. , 

[0010] It is known that a polymer system will evolve If it satisfies the following five conditions: (1) an open system far 
out of equilibrium, (2) a self-replicative system, (3) a mutation system, (4) a system vi^ genotype and* phenotype 
assignment strategy, and (5) a system with appropriate adaptation topography in sequence epaca (1) and (2) are con- 
ditions for occurrence of natural selection and (5) is determined beforehand by the physlcochemical properties of the 
biopolymer. The genotype and phenotype assignment of (4) is a prerequisite tor evolution by natural selectipn. 
[001 1] The following three strategies are adopted in both the natural world and evolutionary molecular engineering: 
(a) ribozyme-type in which the genotype and the phenotype are carried on the same molecule, (b) virus-type in which 
the genotype and the phenotype form a complex, and (c) a cell-type in which the genotype and the phenotype are con- 
tained in a single compartment (Figure 1 ). 

[001 2] As the ribozyme-type (a) in which the genotype and the phenotype are carried on the same molecule is a sim- 
ple system, success with RNA catalysts (ribozymes) has already been reported (Hiroshi Yanagawa (1993) New Age of 
RNA. pp.55-77, Yodosha). 

[0013] Conceivable problem points of the cell-type (c) are (1) the averaging effect, (2) the eccentricity effect and (3) 
the random replication effect. The averaging effect arises because the assignment of the genotype to the phenotype 
statistically averages out and becomes ambiguous when the number of copies of the cell genome is large. Since an 
evolved genome is only one among the number of copies in a cell (n), performance enhancement averages out and a 
struggle for existence in the cell population begins at selection coefficient (s)/n. A smaller copy number (n) is therefore 
advantageous for thecell-type. Due to the presence of the eccentricity effect, however, when the number of segments 
is large, n must be very large to prevent the eccentricity effect. The apparent selection coefficient in the struggle for 
existence in the cell population can therefore be expected to be very much smaller than in the case of the virus-type. 
Since the time required for selection is proportional to the reciprocal of the selection coefficient, the rate of evolution is 
much slower than that of the virus-type. Further, the random replication effect (3) is fatal to the cell-type. This is because 
the random replication of segmented essential genes by this effect makes replication of all essential genes prior to cell 
division extremely difficult. This means that even if an essential gene with an advantageous mutation should occur, the 
probability of its being replicated and passed on to a daughter celt is extremely low. 
[0014] Uniting of the genotype and the phenotype as in the virus-type (b) is necessary for efficient evolution. 
[001 5] Various techniques have already been proposed and are in the process of development for evolutionary molec- 
ular engineering of the virus-type (b) forming a complex of the genotype and the phenotype, including phage display 
(Smith Q. P. (1985) Science 228, 1315-1317; Scott, J. K. & Smith, Q. P. (1990) Science 249, 386-390), polysome Dis- 
play (Iviattheakis, L C. et al. (1994) Proc. Natl. Acad. Sci. USA 91 , 9022-9026). encoded combinatorial library (Brenner, 
S. & Lerner, R. A. (1992) Proc. Natl. Acad. Sci. USA 89. 5381-5383), and cellstat (Husirfii, Y et al. (1982) Rev. Sci. 
Instrum. 53, 517-522). 

[0016] Despite the importance of the magnitude of the searchable sequence space in evolutionary molecular engi- 
neering, however, a method for globally searching a sequence space comparable to that of the ribozyme type has not 
yet been established for the virus-type. 

[001 7] The reason for this is that viruses currently used in the method such as phage displays are parasites of existing 
cells and are therefore unavoidably subject to restraints imposed by the host cells, among which can be listed: (1) that 
only a limited sequence space can be searched owing to restriction by the cells, (2) membrane permeability, (3) bias 
due to host, and (4) limitation on library owing to host population. 

[0018] The polysome display method (Mattheakis, L C. & Dower, W. J. (1995) W095/1 1922) joins a nucleic acid and 
a protein via a ribosome by non-covalent bonding. It is therefore suitable when the chain length at the peptide position 
Js short but encounters handling problems.when the chain length is long as a protein. Since.the huge ribosome remains 
interposed, the conditions at the time of the selection operation (e.g.. adsorption, elution or the like) are subjected to 
severe restriction. The encoded combinatorial library (Janda, F. H. & Lerner, R. A. (1996) W096/22391) assigns a 
chemically synthesized peptide to a nucleic acid tag via beads. Since the yield of chemical synthesis of proteins with 
around 100 residues is extremely poor using currently available technologies, however, this technique can be used with 
short chain-length peptides but not with long chain-length proteins. 

[0019] One conceivable method of overcoming these problems is use of a cell-free translation system. A virus-type 
strategy molecule that simply binds the genotype and the phenotype in the cell-free systems has a number of advan- 
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tages Including the following; (1), that a huge mutant population approaching that of the ribozyme-type can be synthe- 
sized, (2) creation of various proteins without dependence on a host, (3)'no problem regarding membrane permeability, 
and (4) that the 2 1 st cod 6 can be used to introduce a non-native amino acid. , 

5 Description of the Invention 

[0020] An object of the present invention is to provide a molecule comprising a virus-type operation replicon which 
has the advantages of the aforementioned virus-type strategy molecule, exhibits a higher efficiency than phages, and 
1 suffers fewer limitations concerning environmental condition setting, namely, a molecule which should be called In vitro 

10 virus", wherein a nudeic acid and a protein are bound by a chemical bond, that is, a molecule In which a genotype is 
assigned to a phenotype! More specif ically, the present invention has been accomplished in order to provide a molecule 
exhibiting one-on-one relationship between Information and function, which can be utilized for creation of functional pro- 
teins and peptides, by performing genotype (nucleic acid) assignment to phenotype (protein) using a cell-free protein 
synthesis system, and binding the 3 -terminal end of a gene to the C-terminal end of a protein with a covalent bond on 

15 ribosome. Further, it is also an object of the present invention to obtain target functional proteins or peptides through 
investigation of vast sequence space, which is performed by repetition of selection of molecules that assign genotypes 
to phenotypes formed as described above (also referred to as In vitro virus' hereinafter) by the in vitro selection 
method, and amplification of gene portions of the selected in vitro viruses by the reverse transcription PCR, and further 
amplification while introducing mutations. , 

20 [0021] The present inventors earnestly conducted investigations to achieve the aforementioned objects, and as a 
result, they found that two kinds of molecules that assign a genotype to a phenotype, comprising a nucleic acid and a 
protein which were chemically bound can be constructed on a ribosome in a cell-free protein synthesis system. They 
further found that a protein evolution simulation system can be constructed wherein the assigning molecules (in vitro 
viruses) were selected by the in vitro selection method, gene portions of the selected in vitro viruses were amplified by 

25 reverse transcription PCR, and the genes were further amplified while introducing mutations. The present invention has 
been accomplished based on these findings. 

[0022] Thus the present invention provides a molecule assigning a genotype to a phenotype, which comprises a 
nucleic acid portion having a nucleotide sequence reflecting the genotype, and a protein portion comprising a protein 
involved in exhibition of the phenotype, the nucleic acid portion and the protein portion being directly bound by a chem- 
30 ical bond. 

[0023] According to preferred embodiments of the present invention, there are provided the aforementioned assigning 
molecule wherein a 3 -terminal end of the nucleic acid portion and a C-terminal end of the protein portion are bound by 
a covalent bond, and the aforementioned assigning molecule wherein a 3 -terminal end of the nucleic acid portion cov- 
alently bound to a C-terminal end of the protein portion is puromycin. - - — 

35 [0024] According to another preferred embodiment of the present invention, there is also provided the aforementioned 
assigning molecule wherein the nucleic acid portion comprises a gene encoding a protein, and the protein portion is a 
translation product of the gene of the nucleic acid portion. The nucleic acid portion preferably comprises a gene com- 
posed of RNA, and a suppressor tRNA bonded to the gene through a spacer. The suppressor tRNA preferably com- 
prises an arrticodon corresponding to a termination codon of the gene. Alternatively, the nucleic acid portion may 

40 comprise a gene composed of RNA, and a spacer portion composed of DNA and RNA, or DNA and polyethylene glycol. 
The nucleic acid portion may comprise a gene composed of DNA, and a spacer portion composed 0} DNA and RNA. 
[0025] As further aspects of the present invention, there are provided a method tor constructing a molecule assigning . 
a genotype to a phenotype, which comprises (a) boding a DNA comprising a sequence corresponding to a suppressor 
tRNA, to a 3 '-terminal end of a DNA containing a gene through a spacer, (b) transcribing the obtained DNA bonded 

45 product into RNA, (c) bonding, to a 3-terminal end of the obtained RNA, a nucleoside or a substance having a chemical 
structure analogous to that of a nucleoside, which can be covalent ly bonded to an amino acid or a substance having a 
chemical structure analogous to that of an amino acid, and (d) performing protein synthesis in a cell-free protein syn- 
thesis system using the obtained bonded product as mRNA to bond a nucleic acid portion containing the gene to a 
translation product of the gene; and a method for constructing a molecule assigning a genotype to a phenotype, which 

so comprises (a) preparing a DNA containing a gene which has no termination codon, (b) transcribing the prepared DNA 
into RNA, (c) bonding a chimeric spacer composed of DNA and RNA to a 3Merminal end of the obtained RNA, (d) 
bonding, to a ^-terminal end of the obtained bonded product, a nucleoside or a substance having a chemical structure 
analogous to that of a nucleoside, which can be covalently bonded to an amino acid or a substance having a chemical 
structure analogous to that of an amino acid, and (e) performing protein synthesis in a cell-free protein synthesis sys- 

55 tern using the obtained bonded product as mRNA to bond a nucleic acid portion containing the gene to a translation 
product of the gene. 

[0026] According to a preferred embodiment of the present invention, there is provided the aforementioned construc- 
tion method wherein the nucleoside or the substance having the chemical structure analogous to that of the nucleoside 
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[0027] As another aspect of the present invention, there is provided a method for constructing a molecule assigning 
a genotype to a phenotype. which comprises (a) preparing a DNA containing a gene which has no termination bodon, \ 
(b) transcribing theprepared DNA Into RNA, (c) bonding a chimeric spacer composed of DNA and polyethylene glycol ( 
to a 3'-terminal end of the obtained RNA, (d) bonding, to a 3*-terminal end of the obtained bonded product, a nucleoside 
or a substance having a chemical structure analogous to that of a nucleoside, which can be covalently bound to an 
amino acid or a substance having a chemical structure analogous to that of an amino add, and (e) performing protein 
synthesis in a cell-free protein synthesis system using the obtained bonded product as mRNA to bond a nucleic acid 
portion containing the gene to a translation product of the gene. 

[0028] As another aspect of the present invention, there is provided a method for constructing a molecule assigning 
a genotype to a phenotype, .which (Cornprises (a) preparing a DNA containing a gene which has no termination codon, 
(b) transcribing the prepared DNiA Into' RNA, (c) bonding a spacer corpposed of double*tranded DNA to a 3Merminal 
end of the obtained RNA. (d) bonding, to a ^-terminal end of the obtained bonded product, a nucleoside or a substance 
having a chemical structure analogous to that of a nucleoside, which can be covalently bound to an amino add or a sub- 
stance having a chemical structure analogous to that of an amino acid, and (e) performing protein synthesis in a cell- 
free protein synthesis system using the obtained bonded product as mRNA to bond a nucleic acid portion containing 
the gene to a translation product of the gene. 

[0029] As a further aspect of the present invention, there is provided a method for constructing a molecule assigning 
a genotype to a phenotype, which comprises (a) preparing a DNA containing a gene which has no a termination codon, 
and a nucleotide sequence of a spacer, (b) transcribing the prepared DNA into RNA, (c) bonding, to a 3'-terminal end 
of the obtained RNA, a nucleoside or a substance having a chemical structure analogous to that of a nucleoside, which 
can be covalently bonded to an amino acid or a substance having a chemical structure analogous to that of an amino 
acid, (d) adding a short chain PNA or DNA to a 3Merminal end side portion of the gene in the obtained RNA bonded 
product to form a double-stranded chain, and (e) performing protein synthesis in a cell-free protein synthesis system 
using the obtained bonded product as mRNA to bond a nucleic acid portion containing the gene to a translation product 
of the gene. 

[0030] As a still further aspect of the present invention, there is provided a method for protein evolution simulation, 
which comprises a construction step for constructing assigning molecules from a DNA containing a gene by any one of 
the construction methods mentioned above, a selection step for selecting the assigning molecules obtained in the con- 
struction step, a mutation introduction step for introdudng a mutation into a gene portion of an assigning molecule 
selected in the selection step, and an amplification step for amplifying the gene portion obtained in the mutation intro- 
duction step. In the method for evolution simulation, the construction step, the selection step, the mutation introduction 
step and the amplification step are preferably performed repeatedly by providing the DNA obtained in the amplification 
step to the construction step. Further, there is provided an apparatus for performing the aforementioned method for evo- 
lution simulation, which comprises a means for constructing assigning molecules, said means comprising a first bond- 
ing means for bonding a DNA comprising a sequence corresponding to a suppressor tRNA to a 3*-terminal end of a 
DNA containing a gene through a spacer, a transcription means for transcribing the DNA bonded product obtained by 
the first bonding means into RNA, a second bonding means for bonding, to a 3Merminal end of the RNA obtained by a 
transcription means, a nucleoside or a substance having a chemical structure analogous to that of a nucleoside, which 
can be covalently bound to an amino acid or a substance having a chemical structure analogous to that of an amino 
acid, and a third bonding means for performing protein synthesis in a cell-free protein synthesis system using the 
bonded product obtained by the second bonding means as mRNA to bond a nucleic acid portion containing the gene 
to a translation product of the gene, or a means for constructing assigning molecules, said means corrprising a tran- 
scription means for transcribing a DNA containing a gene into RNA, a first bonding means for bonding a chimeric 
spacer composed of DNA and RNA, a chimeric spacer composed of DNA and polyethylene glycol, a double-stranded 
spacer composed of DNA and DNA, or a double-stranded spacer composed of RNA and a short chain peptide nucleic 
acid (PNA) or DNA to a ^-terminal end of the RNA obtained by the transcription means, a second bonding means for 
bonding, to a 3'-termina1 end of the RNA-spacer bonded obtained by the first bonding means, a nucleoside or a sub- 
stance having a chemical structure analogous to that of a nucleoside, which can be covalently bound to an amino add 
or a substance having a chemical structure analogous to that of an amino acid, and a third bonding means for perform- 
ing protein synthesis in a cell-free protein synthesis system using the bonded product obtained by the second bonding 
means as mRNA to bond a nucleic acid portion containing the gene to a translation product of the gene: a selection 
means tor selecting the constructed assigning molecules; a mutation introduction means tor introdudng a mutation into 
a gene portion of an assigning molecule selected; and an amplification means for amplifying the gene portion to which 
the mutation is introduced. 

[0031] As a still further aspect of the present invention, there is provided a method for assaying proteirVprotein or pro- 
tein/nucleic acid inter molecular action, which comprises a construction step for constructing assigning molecules by 
any one of the aforementioned construction methods, and an assay step for examining intermolecular action of the 
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Brief Description of the prawings / . . ' / ; ' . 

5 [0032] 

Figure 1 shows strategies for genotype (nucleic acid portion) assignment to phenotype (protein portion). 

Figure 2 shows a method for construction of the molecule assigning the genotype to the phenotype of the present 

invention wherein a nucleic acid portion and a protein portion are bonded in a site-directed manner. 

to Figure 3 6hows chemically-modified portions of the ^-terminal ends of nucleic acid portions, which are appoint of 
the construction of the molecule assigning the genotype to the phenotype (in vitro virus)., 
■ Figure 4 6hows a method for construction of the molecule assigning the genotype to the phenotype of the present 
invention wherein a nucleic acid portion and a protein portion are bonded In a non-slte-directed manner. 
Figure 5 is a photograph of electrophoresis Image that shows spacer optimization in the site-directed method. It 

i5 shows the results of 4% polyacrylamide gel electrophoresis (in the presence of 8 M urea) of a DNA obtained 
through a process comprising translation of each RNA genome having a spacer in a length corresponding to each 
of the prepared fractions a, b, and c in the presence of a biotinyiated lysyl tRNA in an E. coli cell-free translation 
system, specific absorption on streptavidin-coated magnetic beads, reverse transcription, and amplification by 
PCR (staining was silver staining). Lane 1 is for the spacer length of fraction a (255-306 residues). Lane 2 is tor the 

20 spacer length of fraction 2 (102-238 residues), and Lane 3 is for the spacer length of fraction c (0-85 residues). 

Figure 6 is a photograph of electrophoresis image showing bonding of a nucleic acid portion and a protein portion 
in a 6ite-directed method. The results were obtained by t8% polyacrylamide gel electrophoresis (in the presence 
of 8 M urea and SDS): Lane 1 tor a translation product of mRNA encoding the 4 repeats region of a tau protein, 
which was obtained in an E.coli ceil-free translation system while labeled with f^SJ-methionine, and Lane 2 for a 

25 translation product of the mRNA whose 3Merminal end was bonded to sup tRNA having puromycin. and whose 5 • 
terminal end was labeled with [ 32 P], which was obtained in an E. coli cell-free translation system. 
Figure 7 is a photograph of electrophoresis image showing bonding of nucleic acid portion and protein portion in 
Jhe non-site-directed method. The results were obtained by 18% polyacrylamide gel electrophoresis (in the pres- 
ence of SDS): Lane 1 for a translation product of mRNA encoding the 4 repeats region of a tau protein, which was 

30 obtained in an £ coli cell-free translation system, while labeled with [ 35 S]-methionine, and Lane 2 for a translation 
product of the mRNA whose 3*-terminal end was bonded through a spacer to puromycin labeled with [ 32 P] at the 
5' end. which was obtained in an E. coli cell-free translation system, and Lane 3 for the translation product of Lane 
2 digested with ribonuciease T2. 

Figure 8 shows an example of the method for constructing the molecule assigning the genotype to the phenotype 

35 (in vitro virus) according to the present invention. 

Figure 9 is a photograph of electrophoresis image showing bonding of rCpPur to the C-terminal of the N-terminal 
half (1 -1 65) of human tau protein. Three kinds of genomes, i.e., one having a stop codon but not a DNA spacer (the 
first lane from the left), one having neither of a stop codon and a DNA spacer (the second lane from the left), and 
one not having a stop codon but having a DNA spacer (the third lane from the left) each at the S'-terminal end of 

40 mRNA encoding the N-terminal half (1-165) of human tau protein, were constructed, and translated in a cell-free 
translation system utilizing rabbit reticulocyte lysate In the presence of rCpPur labeled with 32 P at 30°C tor 20 min- 
utes. The translation products were analyzed by 1 1.25% SDS-PAGE. The lane at the right end shows the result for 
a product obtained by translation of the mRNA encoding the N-terminal half of human tau protein (1-165) in the 
presence of [ 35 SJ-methionine under the same condition mentioned above. 

45 Figure 10 is a photograph of electrophoresis image showing generation of in vitro viruses in a cell-free translation 
system. (A) shows a time course of generation of in vitro viruses. A genome composed of the mRNA encoding the 
N-terminal half of human tau protein (1 -1 65), a DNA spacer (1 05 mer), a peptide acceptor, and rCpPur was trans- 
lated in a cell-free translation system utilizing rabbit reticulocyte lysate and containing [^SJ-methionine, and the 
translation product was examined in a time course (at 5 minutes, 10 minutes, 20 minutes, and 40 minutes) at 30°C. 

so The translation products were analyzed by 11.25% SDS-PAGE. The first lane from the left shows the result 
obtained by using the RNA encoding the N-terminal half of human tau protein (1-165) as mRNA. and examining 
incorporation of [ 35 S]-methionine into the protein under the same condition as mentioned above. The lane at the 
right end shows the result of in vitro virus genome labeled with 32 P. (B) shows influence of concentration of in vitro 
virus genome for the generation of in vitro viruses. Lane 1 shows the results for a genome labeled with [^PJ-rCpPur 

55 at the 3'-terminal end. Lane 2 for a genome (1 .2 ug) to which rCpPur was attached at its 3'-terminal end. Lane 3 for 
a genome (0.33 ug) to which rCpPur was attached at its 3'-terminal end. and Lane 4 tor a genome (0.64 ug) to 
which rCpPur was attached at its 3*-terminal end. As for Lanes 2-4, the genomes were translated in a cell-free 
translation system utilizing rabbit reticulocyte lysate and containing [ 35 S]-methionine at 30°C for 20 minutes. The 
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translation products were analyzed by 11 "25% SDS-PAQE, *' * , ^ ! ' 

Figure 11 Is a photograph of electrophoresis image showing generation of In vitro viruses In a cell-free translation 
system. An In vitro' virus genome composed of the mRNA encoding the N-termlnal half (1-165) of human tau pro- 
tein, a DNA spacer (105 mer), a peptide acceptor, and [ 32 P]-rCpPur was translated by utilizing rabbit reticulocyte 
lysate at 30°C for 20 minutes. The translation products were analyzed by 11.25% SDS-PAGE. The bonding of the' 
genome and the protein could be confirmed by digestion with mung bean nuclease. When the translation product 
(Lane 3) was digested with mung bean nuclease, bands appeared (Lane 4) at the locations corresponding to mon- 
omer and dimer (Lane 1 ) of the N-terminal half of human tau protein (1 -1 65). Lane 2 shows the result for an in vitro 
virus genome labeled with ^R 

Figure 12 shows process steps of a protein evolution simulation method utilizing In vitro viruses. 

• . "V'V ' '. ; "V""' ■ ' v *. ' : " ' • ' 

Best Mode for Carrying oirt the invention , ' ■ 

[0033] In this specification, some technical terms are used, and those technical terms have the following meanings 
when herein used. The term "nucleic acid portion" means a bonded product of a nucleoside or a substance having a 
chemical structure analogous to a nucleoside, for example, RNA, DNA, PNA (peptide nucleic acid; polymers comprising 
nudeic acids linked via amino acid analogues) and the like, and "protein portion" means a bonded product of an amino 
acid or a substance having a chemical structure analogous to an amino acid such as naturally-occurring amino acids 
and non-naturally-occurring amino acids. The term "suppressor tRNA (sup tRNA)" means a tRNA which can suppress 
mutation by structural change, for example, reading a termination codon on mRNA as a codon corresponding to a cer- 
tain amino acid. The expression of "having a nucleotide sequence reflecting genotype" means to contain a gene or a 
part thereof relating to a genotype, The expression of "containing a protein involved in exhibition of phenotype" means 
to contain, for example, a protein whose expression itself is a characteristic of phenotype, a protein involved in exhibi- 
tion of a characteristic of phenotype by its function as an enzyme or the like. 

[0034] The spacer located at the 3'-terminal end side of the nucleic acid portion may be any spacer provided that ft is 
a polymer substance preferably having a length of not less than 1 00 A, more preferably about 1 00 to 1 000 A. Specifically, 
single-stranded chains of RNA or DNA, double-stranded chains of DNA and DNA, double-stranded chains of RNA and 
short chain PNA or DNA (e.g.. about 15 to 25 nucleotides), and polymer materials such as polysaccharides, which are 
naturally-occurring or synthetic, synthetic organic polymer substances such as polyethylene glycols, preferably polyeth- 
ylene glycols having a molecular weight of about 3,000 to 30,000 and the like can be mentioned. 
[0035] The nucleic acid portion and the protein portion of the assigning molecule of the present invention are linked 
through a chemical bond such as a covalent bond. In particular, preferred are those formed by bonding a nucleoside or 
a substance having a chemical structure analogous to a nucleoside, or a linked product thereof present at the 3*-termi- 
"nal end of the~nucleic~acid "portion to" an aminoacid or a substance having a chemical structure analogous to an amino 
acid present at the C-terminal end of the protein portion via a chemical bond, for example, a covalent bond. 
[0036] For the bonding between the nucleic acid portion and the protein portion, for example, puromycin, 3 -N-ami- 
noacylpuromycin aminonucleoside (PANS-amino acid), which have an amide bond as the chemical bond at the 3'-ter- 
minal end of the nucleic acid portion, e,g., PANS-Gly wherein the amino acid portion is glycine, PANS- Vial wherein the 
amino acid portion is valine, PANS-Ala wherein the amino acid portion is alanine, and further PANS-(any of the other 
amino acids) wherein the amino acid portion is an of the other amino acids, can be utilized. 3 -W-Aminoacyl-adenosine 
aminonucleoside (AANS-amino acid), which comprises as the chemical bond an amide bond formed by dehydration 
condensation of the amino group of 3'-aminoadenosine and the carboxyl group of an amino add, for example, AANS- 
Gly wherein the amino acid portion is glycine, AANS-Val wherein the amino acid portion is valine, AANS-Ala wherein 
the amino acid portion is alanine, and further AANS-(any of the other amino acids) wherein the amino acid portion is 
any of the other amino acids, can also be utilized. Those composed of a nucleoside or a nucleoside bound to an amino 
acid via an ester bond may also be used. Further, any other materials having a binding mode capable of binding a nucl- 
eoside or a substance having a chemical structure analogous to a nucleoside and an amino acid or a substance having 
a chemical structure analogous to an amino acid can also be utilized. 

[0037] The molecule assigning the genotype to the phenotype of the present invention can be constructed by, for 
example, (1) a method where the binding of the nucleic acid portion and the protein portion is formed in a site-directed 
manner, or (2) a method where the bonding of the nucleic acid portion and the protein portion is formed in a non-site- 
directed manner, which will be explained hereinafter. 

[0038] First, (1) the method where the bonding of the nucleic acid portion and the protein portion is formed in a site- 
directed manner will be explained. 

[0039] In this method, a molecule assigning a genotype to a phenotype can be constructed by (a) bonding a DNA 
comprising a sequence corresponding to sup tRNA, to the 3'-terminal end of a DNA containing a gene through a 
spacer, (b) transcribing the obtained DNA bonded product into RNA, (c) bonding, to the 3*-terminal end of the obtained 
RNA, a nucleoside or a substance having a chemical structure analogous to that of a nucleoside, which can be cova- 
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lently bound to an arrind acid or a substance having a chemical structure^analoooustothat of an amino add, e.g., puro- 
mydn,(d) peffonping protein synthesis in a cell-free protein synthesis system, e.g M ,an E coll celf-frde protein synthesis 
system; using the obtained bonded product as mRN A, and thus (e) affording a molecule assigning a genotype to a phe- 
notype comprising a gene RNA (genotype) and a protein (phenotype) which Is a translation product of the gene, which 
5 are chemically bound through a nudeoside or a substance having a chemical structure analogous to that of a nucleo- 
side, e.g., puromydn. 

[0040] That is, according to this method of the present invention, when a termination codon comes into the A site of 
ribosome during the protein synthesis, a sup tRNA is correspondingly incorporated, and a nudeoside or a substance 
having a chemical structure analogous to that of a nucleoside, e.g., puromycin, present at the S'-termlnal end Of the sup 
tRNA is bound to a protein by the action of peptidyl transferase (Figure 2). Therefore, this method is site-directed as for 
the formation of the bonding between the nucleic acid portion and the protein, which depends on the genetic code. 
[0041] ft has been known that puromycin (Figure 3) inhibits the protein synthesis in bacteria (Nathans, D, (1964) Proc. 
Natl. Acad. Sd. USA, 51 . 585-592; Takeda. Y. et al. (1960) J. Biochem. 48, 169-1 77) and animal ceils (Ferguson, J. J. 
(1962) Biochim. Biophys. Acta 57, 616-617; Nemeth, A.M. &dela Haba, G. L (1962) J. Biol. Chem. 237, 1190-1193). 
Puromycin, whose structure resembles the structure of aminoacyl tRNA, reacts with peptidyl tRNA bound to the P site 
of ribosome, and it is released from ribosome as peptidyl puromycin, and thus interrupts the protein synthesis (Harris, 
R. J. (1971) Biochim. Biophys. Acta 240, 244-262). 

[0042] It is not practical to purify native sup tRNA and bond it to mRNA, because of the problems concerning the puri- 
fication of sup tRNA and the easily hydrolyzable ester bond at the 3*-terminal end of tRNA. Through investigations of 
tRNA identity, it has been elucidated that unmodified tRNA may be aminoacytated like intact tRNA, and that the ami- 
noacylated unmodified tRNA may be taken into rfoosome, and translated (Shimixu, M. et al. (1992) J, Mol. Evol. 35, 
436-443). The identity of tRNA is also utilized in order to prepare sup tRNA. 

[0043] it has been reported that the aminoacyl synthetases d alanine, histidine, and leudne do not recognize the anti- 
codons thereof (Tamura. K. et al. (1991) J. Mol. Recog. 4, 129-132). Therefore, it can be expected that, by repladng the 
anticodon of tRNA for alanine with a termination codon (ag., amber), tRNA for alanine (sup tRNA) would be incorpo- 
rated corresponding to the termination codon. 

[0044] In this respect, it comes int o question whether t RNA whose 5 -terminal-end side is not made up by RNAse P 
or the like, unlike ordinary tRNA, may enter into the A site of ribosome or not. This is the most important problem to be 
investigated in determining feasibility of the model of the present invention: It has been known that the 3Merminal ends 
of Brome Mosaic Virus (BMV) and Turnip Yellow Mosaic Virus (TYMV) have a tRNA-like structure, and they are ami- 
noacytated by aminoacyl synthetase, and incorporated at an efficiency of 1% in a cell-free translation system (Chen, J. 
M. & Hall. T. C. (1973) Biochemistry 12, 4570-4574). Supposing that RNA of BMV is incorporated even by 1% by ribos- 
ome, it can be expected that RNA having intact tRNA at its 3'-terminal end may be incorporated more efficiently. Even 
if it is incor porated at an ef ficiency of 10% or less ot that of inta ct tRNA, th ere is a reasonable possibility that it can win 
the competition with the release factor by the concentration effect. 

[0045] Therefore, before the experiment for bonding a protein to the 3'-terminal end of mRNA-sup tRNA (mRNA 
ligated at its 3'-terminal end with sup tRNA through a spacer), it was examined whether even sup tRNA separated from 
mRNA entered into the A site of ribosome and was bound to a protein. A sup tRNA whose 3'-terminal end was bonded 
to puromydn was actually prepared, and added to a cell-free protein synthesis system to examine whether the sup 
tRNA portion entered into the A site of ribosome corresponding to occurrence of a termination codon and bound to a 
protein. The 4 repeats region of tau protein (127 residues) was used as mRNA (Goedert, M. (1989) EMBO J. 8, 392- 
399). As a result, when the translation was performed in a cell-free protein synthesis system, It could be confined that 
the sup tRNA having puromydn at its 3-terminal end was incorporated into the A site of ribosome corresponding to a 
termination codon and bound to a protein (Figure 2). 

[0046] Then, RNA-sup tRNA bonded products having different lengths of the spacer between mRNA and sup tRNA 
were constructed, and it was attempted to select an optimum length of the spacer which afforded the best efficiency of 
the incorporation of the sup tRNA portion into the A site of ribosome by the in vitro selection method. As a result, it was 
found that the RNA-sup tRNA bonded product having a certain spacer length was chemically bound to a protein that 
was a translation product thereof with a good efficiency. 

[0047] In order to construct the molecule assigning the genotype to the phenotype of the present invention, a nude- 
osjde or a substance having a ch emical structure anal ogous t o that of a nucleoside, w hich is to be .bond^Jto the 3'- 
terminal endof the nucleic acid portion, and can be covalently bound to an amino acid or a substance having a chemical 
structure analogous to that of an amino add, e.g., 2 '-deoxycytidylylpuromydn (dCpPur) and ribocytidyipuromycin (rCp- 
Pur) (Figure 3), must be synthesized first. 

[0048] An exemplary method for synthesizing dCpPur is as follows. First, puromycin-5 '-monophosphate can be pre- 
pared by chemically phosphorylating the 5 '-hydroxy! group of puromycin using phosphorus oxychloride and trimethyl- 
phosphate. Then, the amino group of the amino acid portion and the 2'-hydroxyl group of the ribose portion in 
puromydn-5 '-monophosphate can be protected by reacting puromycin-5 '-monophosphate with triftuoroacetic acid and 
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, trHluoroacetic anhydride. The protected product can be reacted with Bz-DMT deoxycytidine In which the amino group 
of the pyrimidine ring and the S'-hydroxyl group of the ribose portion In deoxycytidine are protected, In the presence of 
a condensation agent, dcyclohexylcarbodilmide, and then deprotected with acetic acid and ammonia to afford 2 '-deox- 
ycytkJylylpuromycin (dCpPur). pdCpPur can be obtained by phosphorating the 5 -hydroxyl group of dCpPur with poly- 
5 nucleotide kinase 

[0049] The ribocytkJytpuromycin can be prepared by condensing puromycin and rC-p-amidite having protective 
groups in the presence of tetrazole. and oxidizing and deprotecting the product. 

[0050] Then, the construction of a bonded product constituting the nucleic acid portion for binding the nucleic acid 
portion and the protein portion in a site-directed manner will be described hereinafter. 

10 [0051] As the bonded product constituting the nucleic acid portion used for the site-directed method, tor example, a 
bonded product comprising 5'-(T7 promoter region)-(Shine-Dalgarno (SD) sequence region)-(mRNA regionHspacer 
regton)-(sup tRNA region)-(puromycin region)-3 ' connected in this order in sequence can be mentioned. 
[0052] In the construction of this bonded product for the nucleic acid portion, a plasmkj comprising the 4 repeats 
region, which is a microtuble-binding region of human tau protein called htau24 (Ooedert, M. (1989) EMBQ J. 8, 392- 

15 399), inserted downstream of T7 promoter (pAR3040) is constructed first, and it is digested with restriction enzymes 
BglW and BamH\ to afford a linear DNA. This DNA is used as a template, and amplification is carried out by PCR by 
using primers for upstream region containing T7 region (forward) and for downstream region containing the SD region 
and a region around the initiation codon (backward), and Taq DNA polymerase. 

[0053] In the above method, three methionines may be added to the backward primer in order to enhance detection 
so sensitivity for radioactive methionine in the protein portion after the protein synthesis. That is, leucine at position 4, and 
lysines at positions 5 and 8 of the 4 repeats region are replaced with methionines. Eventually, the translated 4 repeats 
protein contains four methionines in total. Then, amplification is carried out by PCR using a DNA containing the afore- 
mentioned linearized 4 repeats region as a template, a complementary chain of the backward primer mentioned above 
as a forward primer, and a backward primer which is designed so that the C-terminal end of the 4 repeats region should 
25 have an amber codon as the termination codon. 

[0054] The two kinds of DNA fragments amplified by the PCR. namely, the DNA fragment containing the 17 promoter 
and the SD region and the DNA fragment containing the 4 repeats region are mixed, initially extended without primers, 
and then amplified by PCR again by using a primer containing the sequence of T7 promoter as the forward primer, and 
a primer containing a termination codon at the C-terminal of the 4 repeats region as the backward primer. 
30 [0055] This DNA bonded product (T7 promoter-SD-4 repeats) is ligated to a double-stranded DNA fragment having 
cohesive ends at the both ends and composed of 1 7 residues in tandem by using DNA ligase to afford ligation products 
having different spacer lengths. 

[0056] After the ligation, the product was fractionated into three fractions (a, b, c) based on the length by polyacryla- 
mide gel electrophoresis (PAGE). The spacer is represented as (17)n wherein n - 15 to 18 for the fraction a, n - 6 to 

35 14 for the fraction b, and n - 0 to 5 for the fraction c. As the sup tRNA, a native alanyl tRNA whose several sites and 
anticodon are modified into amber (UAQ) is prepared by chemical synthesis. This sup tRNA is ligated to the ligation 
products of the fractions a, b and c having different spacer lengths by using T4 DNA ligase. For the ligation site, an 
excessive amount of a single-stranded backing DNA is used, and after once melted by temperature elevation, the 
strands are annealed, and a complementary strand is formed, and ligated. After the ligation, the ligation product is 

40 amplified by PCR using primers for the 5' end and 3 -terminal end of the ligation product. This DNA ligation product is 
transcribed by using T7 RNA polymerase to form an RNA ligation product 

[0057] By ligating the pdCpPur chemically synthesized in the above to the 3 '-terminal end of this RNA ligation product 
using T4 RNA ligase, there can be obtained an RNA ligation product, 5 '-(T7 promoter region)-(SD region)-(4 repeats 
region)-(spacer region)-(sup tRNA region) -(puromycin)-3 ', which can be used as a gene in a cell-free protein synthesis 
45 system. 

[0058] The protein synthesis is performed by adding the above RNA ligation product as mRNA to a cell-free protein 
synthesis system such as cell-free protein synthesis extracts of £ coti or rabbit reticulocytes. In order to obtain an opti- 
mum spacer length for obtaining the most efficient bonding of the nucleic acid portion (RNA) and the protein portion, 
the following experiment is performed. 

so [0059] That is, the protein synthesis is performed in a cell-free protein synthesis system by using aforementioned RNA 
ligation products having three kinds of different spacer lengths, corresponding to the fractions a. b and c. as the gene. 
In this synthesis, by adding tRNA charged with modified lysine comprising biotin bound through the c-amino group of 
the lysine, bioiinyltysine is incorporated in several positions of lysine residues in the translated 4 repeats protein. After 
the protein synthesis, magnetic beads coated with streptavidin on their surfaces are added to isolate the protein incor- 

55 porating the biotin. 

[0060] If the nucleic acid portion (RNA) has bonded to the protein portion through puromycin, the nucleic acid portion 
(RNA) should be bound to the C-terminal end of the protein. When reverse transcription was performed by using a 
sequence corresponding to the N-terminal region of the 4 repeats as a forward primer and the 3'-terminal end portion 
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'of su^ gel electroprweste to "confirm whether- the RNA- 

' prtfW-Bondec^ up^by the mkgnetlc beads, a band o> reverse transcribed DNA'was 

observed only tor the spacer length of the fraction c. This means that the RNA ligation product having the spacer length 
of the i fraction cterrk^ efficiently bound to the protein portion. 1 
[0061] Now, it will be explained about (2) the method where the bonding of the nucleic acid portion and the protein 
portion is famed in a non-site-directed manner. 

[0062] In this method, a molecule assigning a genotype to a phenotype can be constructed by (a) preparing a DNA 
containing a gene which has no termination codon, (b) transcribing the prepared DNA into RNA, (c) bonding a chimeric 
spacer composed of DNA and RNA to the 3'-terminal end of the obtained RNA, (d) bonding, to the 3Merminal end of 
the bonded preset, a nucleoside or a substance having a chemical structure analogous to that of a nucleoside, which 
can be cbvalently bound to an amino acid or a substance having a chemical structure analogous to that of an amino 
acid, e.g., puromycin!' (e) performing protein synthesis in a cell-free protein synthesis system using the obtained bonded 
product as mRNA, and thus (f) attorning a molecule assigning a genotype' to a phenotype comprising a gene RNA and 
a protein which Is a translation product of the gene, which are chemically bound through puromycin or the like. 
[0063] That is, according to this method of the present invention, a nucleoside or a substance having a chemical struc- 
ture analogous to that of a nucleoside, e.g., puromycin, present at the 3'-terminal end of the nucleic acid portion does 
not enter into the A site of ribosome corresponding to the termination codon of mRNA on ribosome, but randomly enters 
depending on the spacer length, and puromycin or the like at the 3 -terminal end of the RNA-DNA chimera nucleic acid 
portion is chemically bound to a protein by the action of peptidyl transferase (Figure 4). Therefore, this method is non- 
site-directed as for the formation of the bonding between the nucleic acid portion and the protein, which does not 
depend on the genetic code. 

[0064] In this method, the molecule assigning the genotype to the phenotype can be constructed by using a non-she- 
directed ligation product for the nucleic acid portion in the same manner as in the aforementioned site-directed method 
of(1). 

[0065] As the ligation product constituting the nucleic acid portion used for the non-site-directed method, for example, 
a ligation product composed of 5'-(T7 promoter region)-(Shine-Daloarno (SD) sequence region)-(mRNA region)- 
(spacer region)-(puromycin region) -3 ' connected in this order in sequence can be mentioned. 
[0066] In the construction of this ligation product for the nucleic acid portion, the construction from the T7 promoter 
region to the end of the 4 repeats region may be similar to that explained for the construction of the ligation product lor 
the nucleic acid portion used in (1) the site directed method mentioned above, provided that a primer designed not to 
have a termination codon by replacing the two termination codons at the C-terminal end of the 4 repeats, ochre (CTG) 
and amber (TAA), with CAG (glutamine) and AAA (lysine), respectively, is used as a backward primer for the PCR ampli- 
fication of the ligation product constructed above used as a template. 

[0067] This DNA ligation product is transcribed as a template by using T7 RNA polymerase to afford a corresponding 
RNA ligation product. This single-stranded RNA ligation product is separately ligated to each of single-stranded chem- 
ically-synthesized DNA tinkers (chain length; 20, 40, 60, and 80 nucleotides) by using T4 RNA ligase. Then, each liga- 
tion product is ligated to a single-stranded DNA-RNA chimeric oligonucleotide comprising 25 residues (DNA; 21 
residues, RNA; 4 residues), which is designated as peptide acceptor, by using T4 DNA ligase in the presence of a sin- 
gle-stranded backing DNA. 

[0068] Because the sequence of the peptide acceptor contains the 3'-terminal end sequence of alanyl tRNA, and it 
enhances the Incorporation of a puromycin derivative into the A site of ribosome, It Is preferable to use the peptide 
acceptor between the spacer region and the puromycin region. 

[0069] By ligating the pdCpPur chemically synthesized in the above to the 3Merminal end of the above ligation prod- 
uct using T4 RNA ligase, there can be obtained an RNA-DNA chimeric ligation product 5 -(T7 promoter region (RNA))- 
(SD region (RNA))-(4 repeats region (RNAJ)-(spacer region (DNA))-(peptide acceptor region)-(puromycin)-3'. which 
can be used as a gene for a cell-free protein synthesis system. 

[0070] If protein synthesis is performed by using the RNA-DNA chimeric ligation product mentioned above as a gene 
in a cell-free protein synthesis system, there can be obtained a bonded product comprising a nucleic acid portion (RNA- 
DNA chimeric ligation product, genotype) and a protein portion (phenotype), which are connected by a chemical bond 
through puromycin. 

[0071] In the aforementioned method, a chimeric spacer of DNA and polyethylene glycol can also be used instead of 
the chimeric spacer of DNA and RNA. 

[0072] In the above method, a spacer composed of a double-stranded chain of DNA and DNA, or a double-stranded 
chain of RNA and short chain PNA or DNA (e.g., about 1 5 to 25 nucleotides) may also be used instead of the chimeric 
spacer of DNA and RNA. The spacer composed of the double strand of DNA and DNA is not necessary to be double- 
stranded in its full length, and it may be double-stranded for most part (usually, several residues at the both ends are 
single-stranded, and remaining portion is double-stranded). The double-stranded spacer composed of RNA and short 
chain PNA or DNA can also be prepared by (a) preparing a DNA containing a gene which has no termination codon, 
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, and a nucleotide sequence bra spacer, (b) transcribing the'prepared DNA Into RNA, (c) bondloQ;tothe SMermlnal end 
of the obtained RNA, a nucleoside a a substance having a chemical structure analogous to that of a nucleoside, which 
can be covalently bound to an amino acid or a substance having a chemical structure analogous to that of an amino 
acid, and (d) adding a short chain PNA or DNA to a 3'-terminal-end side portion of the gene In the obtained RNA 

5 bonded product to form a double-stranded chain. 

[0073] The genetic engineering techniques mentioned in the present specification such as isolation and preparation 
of nucleic acids, ligation of nucleic acids, synthesis of nucleic acids, PCR, construction of plasmids, and translation in 
cell-free system can be performed by the methods described in Sambrook et al. (1989) Molecular Cloning, 2nd Edition, 
Cold Spring Harbor Laboratory Press, or similar methods unless otherwise indicated. 

10 [0074] The assigning molecule of the present invention can also be obtained by successively bonding each of the 
elements by any known chemical bonding methods In addition to the methods exemplified above.' tt* • "*m v "*n < 
[0075] The protein evolution simulation method of the present invention is a method comprising steps of (1) construc- 
tion of In vitro virus genomes, (2) completion of in vitro viruses, (3) selection process, (4) introduction of mutation, and 
(5) amplification, as shown in Figure 12. These steps or repetition of these steps as required allows modification and 

i5 creation of functional proteins. Among these steps, the steps of (1 ) and (2) can be performed by the construction meth- 
ods explained above in detail. That is, the step (1) corresponds to construction of the bonded product comprising the 
nucleoside or the substance having the chemical structure analogous to that of the nucleoside, and the step (2) corre- 
sponds to the construction of the assigning molecule from the bonded product The steps of (3), (4) and (5) will be 
described hereinafter. 

20 [0076] The selection process of (3) means a process of evaluating function (biological activity) of protein portionscon- 
6tituting the in vitro viruses, and selecting in vitro viruses based on a desired biological activity. Such a process has 
been known, and described in, for exanrple, Scott. J. K. & Smith, G. P. (1990) Science, 249. 386-390; Devlin, R E. et al. 
(1990) Science, 249. 404-406; Mattheakis. L. C. et al. (1994) Proc. Natl Acad. Sci. USA, 91, 9022-9026 and the tike. 
[0077] Then, mutations are introduced into the nucleic acid portions of the selected in vitro viruses, and the in vitro 

2$ viruses are amplified by PGR or the like in the steps of (4) introduction of mutation, and (5) amplification. When the 
nucleic acid portion of in vitro viruses is composed of RNA, mutation can be introduced after a cDNA is synthesized by 
reverse transcriptase. The amplification of the nucleic acid portion may also be performed while introducing mutation. 
' The introduction of mutation can be readily performed by already established error-prone PCR (Leung. D. W.. et al.. 
(1989) J. Methods Cell Mol. Biol.. 1. 11-15). Sexual PCR (Stemmer, W. P. C. (1994) Proc. Natl. Acad. Sci. USA 91. 

30 10747-10751) or the like. 

(1) In vitro virus genomes can be constructed by using nucleic acid portions for in vitro viruses which have been 
introduced with mutation and amplified, (2) in vitro viruses can be completed by using the in vitro virus genomes, 
(3) in vitro viruses can be selected based on a desired biological activity, and (4) mutation introduction and ampli- 
3$ fication can be carried out. By repeating these steps as required, modification and creation of functional proteins 
can be realized. 

[0078] The means contained in the apparatus of the present invention for performing the aforementioned protein evo- 
lution simulation method themselves are known ones, and operations in these means 6uch as addition of reagents, stir- 
ao ring, tenpeijature control, and evaluation of biological activity can be performed according to the methods known per 
se. By combining these operations, an automatic or semi-automatic apparatus of the present invention can be con- 
structed. 

[0079] The step of constructing assigning molecules in the method for assaying pr oteinfcrotein or protein/nucleic acid 
intermolecular action of the present invention generally comprises steps of (1) synthesizing mRNA from a gene library 

45 or a cDNA library, and constructing an in vitro genome, and (2) constructing an in vitro virus comprising mRNA and a 
corresponding protein, which are bonded on ribosome, by utilizing a cell-free protein synthesis system. 
[0080] The step (1 ) corresponds to synthesis of mRNA using RNA polymerase from cDNA of DNA of which sequence 
has been known and which contains a sequence corresponding to ORF. cDNA of DNA of which sequence is unknown 
and which contains a fragment resulting from fragmentation with a suitable restriction enzyme or the like, and construc- 

so tion of an in vitro virus genome by utilizing the mRNA. 

[0081] The steps of above (1 ) in vitro virus genome construction, and (2) in vitro virus construction can be performed 
by the construction methods explained above in detail. 

[0082] The assay step for examining intermolecular action between assigning molecules and other proteins or nucleic 
acids (DNA or RNA) usually comprises steps of (3) selecting only proteins having a particular function from the in vitro 
55 viruses constructed in the step (2). and (4) subjecting selected in vitro viruses to reverse transcription, amplification, 
and sequencing. 

[0083] In the step (3). target proteins or nucleic acids (DNA or RNA) and other substances, for example, saccharides 
and lipids, are bound to a microplate, beads or the like beforehand through covalent bonds or non-covalent bonds, and 
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.the In vitro viruses constructed in the step (2) are added thereto, to react under a certain temperature condition foe a 
certain period of time, and it Is washed to remove in vitro viruses which has not been bound to the target. Then, the in 
vitro viruses which have been bound to the target are released. This step can be performed by the already-established 
EUSA (Enzyme Linked Immunosorbent Assay, Crowther, J. R. (1995) Methods in Molecular Biology, Vol. 42, Humana 
Press Inc.) or a similar technique. 

[0084] In the step (4), the in vitro viruses released in the step (3) are reverse-transcribed and amplified by reverse 
transcription PCR, and the amplified DNA was sequenced directly or after cloning. 

[0085] According to the assay method, of the present invention, it becomes possible to identify a function of a gene 
product (protein) corresponding to a gene whose function is unknown by (1) synthesizing mRNA from gene DNA whose 
sequence is known or unknown to construct in vitro virus genomes, (2) constructing in vitro viruses by using the in vitro 
virus genomes, (3) selecting only those binding to a target protein or nucleic acid or other substances, for example, sac- 
charides and lipids from among the in vitro viruses, and (4) subjecting selected in vitro viruses to reverse transcription, 
amplification, cloning and sequencing. 

[0086] To perform the method for assaying intermolecular action mentioned above, an apparatus can be constructed 
by combining known appropriate means. The means contained in the apparatus may be per se known ones, and oper- 
ations in these means such as addition of reagents, stirring, temperature control, and evaluation of biological activity 
can be performed according to the methods known per se. By combining these operations, an automatic or semi-auto-, 
matic apparatus for assaying intermolecular action can be constructed. 

20 Examples 



10 
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[0087] The present invention will be more specifically explained with reference to the following examples. However, 
the following examples should be construed to be an aid for obtaining more specific understanding of the present inven- 
tion, and the scope of the present invention is not no way limited by these examples. 

Example 1 : Preparation of in vitro virus (1) 



< 1 ) Preparation of 3'-ferminal-end portion of nucleic acid portion 
30 (a) Synthesis of phosphorylated puromycin (pPur) Materials: 



[0088] Puromycin (3'-[a-amino-p-methoxyhydrocinnamamido]^ was purchased 

from Sigma. Phosphorus oxychloride, and trimethyl phosphate were purchased from Wako Pure Chemicals. 



35 Methods: 

[0089] A solution formed by mixing phosphorus oxychloride (1 .5 mmol) and trimethyl phosphate (1 1 .4 mmol) was ice- 
cooled, and puromycin (0.3 mmol) was added thereto to mix sufficiently, and the mixture was allowed to react at 0°C for 
7 hours (Yoshikawa, M. et al. (1969) Bull. Chem. Soc. Jap. 42, 3505-3508). The reaction mixture was then added to an 
40 ice-cooled mixture of acetone (40 ml) and ether (20 ml) containing sodium perchlorate (NaCI04, 0.4 g), and stirred suf- 
ficiently. Then, water' (720 ml) was added to the mixture, and the mixture was stirred at 4°C for 24 hours to hydrolyze 
the chlorine group. The product precpitated after the hydrolysis was separated by centrrfugation, and washed with ace- 
tone and ether. The resulting white powder was dried in vacuo to afford phosphorylated puromycin with a yield of 70 to 
90% based on the puromycin. 

45 

(b) Protection of phosphorylated puromycin by acetylation Materials: 



[0090] Trifluoroacetic acid (TFA) was purchased from Nacalai Tesque. Trrfluoroacetic anhydride (TFAA) was pur- 
chased from Wako Pure Chemicals. 

so 

Methods: 



[0091] The dried phosphorylated puromycin (0.2 mmol) and TFA (5 ml) were mixed, and TFAA (2 ml) was added to 
the mixture at -10°C. followed by stirring. The mixture was allowed to react at room temperature for 1 hour with stirring 
55 (Weygand. F. & Gieger, R. (1956) Chem. Ber. 89. 647-652). The reaction was quenched by adding water (50 ml), and 
the TFA was removed by repeating a procedure comprising addition of water (10 ml) and evaporation to dryness under 
reduced pressure 5 times. Finally, water (50 ml) was added to the resulting product, followed by lyophilization to afford 
phosphorylated puromycin in which the amino group of the amino acid portion in puromycin and the 2'-hydroxyl group 
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of the ribose portion were protected with acetyl groups with a yield of 60 to 60% based on the phpsphorylated puromy 



(C) Synthesis of dCpPur (2'<leoxycytldyl(3'->5')puromycln) Materials:* 

5 [0092] BZ-OMT deoxycytidine (A/^benzoyl-S '-0-(4,4 '-dimethoxytrityl)-2'<ieoxycytidine) was purchased from Sigma, 
and DCC (dicyclohexyl carbodiimide) was purchased from Watanabe Chemical. The pyridine was purchased from 
Nacalai Tesque. 

10 Methods: 

[0093] The phosphoryiated puromycin protected with acetyl groups (40 jimol) and Bz-DMT deoxycytidine (600 nmol) 
were dehydrated by repeating a procedure comprising addition of pyridine (2 ml) and evaporation to dryness three 
times, and finally pyridine (2 ml) was added thereto. DCC (400 jimol) was added to the mixture with stirring, and the 

is mixture was allowed to react at room temperature tor 3 days to 2 weeks (Ralph, R. K. et at. (1965) J. Am. Chem. Soc. 
87, 5661-5670; and Harris, R. J. et al. (1972) Can. J. Biochem. 50, 918-926). After the reaction, the DMT group was 
removed by a reaction with 80% acetic acid (5 ml) for two hours. Then, the acetyl group was removed by a reaction with 
concentrated aqueous ammonia/ethanol (6 ml, volume ratio: 2:1) at20°C for 2 days. The concentrated aqueous ammo- 
nia was removed by evaporation under reduced pressure, and the residue was dissolved in water (40 ml). The resulting 

20 solution was applied on a column packed with QAE-Sephadex A-25 (Pharmacia) and adsorbed thereon. Fractions con- 
taining the target product were eluted with 0.5 M triethylamine carbonate (TEAB, pH 7.5), lyophilized. and f inally sepa- 
rated by HPLC to afford deprotected dCpPur with a yield of 1 to 5% based on the puromycin. 

< 2 > Preparation of nucleic acid portion (in vitro virus genome) 

25 ... 
[0094] Two kinds of in vitro virus genomes, i.e., (1) one for bonding a nucleic acid portion and a protein portion in a 
she-directed manner, and (2) one for bonding a nucieic acid portion and a protein portion in a non-site-directed manner, 
' were prepared. 

30 Materials: 

[0095] An E. coli cell-free protein synthesis system (£ coll S30 Extract System for Linear Templates) was purchased 
from Promega. T7 RNA polymerase, T4 DNA ligase, T4 DNA kinase, human placenta ribonuclease inhibitor, EcoRI, 
8amHI and deoxyribonucleotides were purchased from Takara Shuzo. Restriction enzymes BstH\ and 80/11 were pur- 

35 chased" from New England Labs. As for [ 35 S]-methionine, and [ r 32 P]- ATP, those from Amersham. and as for Taq DNA 
polymerase, those from Kurabo and Grainer were used. As for the other biochemical reagents, those from Sigma and 
Wako Pure Chemicals were used. A plasmid containing the microtuble-btnding region of human tau protein (4 repeats) 
(pAR3040) was prepared by picking up the full length gene of human tau protein from a cDNA library of human brain 
cloned in X ZAPII by PCR. introducing the gene into a plasmid, amplifying only the 4 repeats region in the plasmid, and 

40 introducing the amplified product into a plasmid. As the PCR (polymerase chain reaction) apparatuses. Model PTC-100 
(MJ Research) and Model ASTEC PC800 (Astec) were used. 

(1) Preparation of genome for site-directed bonding 

45 A. Preparation of DNA for mutated 4 repeats portion 

[0096] 

1) A plasmid (pAR3040) comprising microtuble region (4 repeats) of human tau protein (Goedert, M. (1989) EMBO 
so J. 8, 392-399) was constructed, and linearized by digestion with restriction enzymes BpVII and BamHl 

2) The 4 repeats portion containing the T7 promoter region and the Shine-Dalgarno sequence was amplified by 
PCR from the genome prepared above. For this amplification, as primers. Left+ (SEQ ID NO: 1) was used for 5' 
side, and Right-(SEQ ID NO: 2) for the 3' side. Right- had such a sequence that the leucine before the ochre ter- 
mination codon should be mutated into amber termination codon. The PCR conditions were 92°C/30 seconds for 

55 denaturation, 65°C/30 seconds for annealing, and 73°C/1 minute for elongation, and this cycle was repeated 30 

times. . »• « 

3) Then, the amplified genome was purified, and mutated by utilizing PCR in order to promote the incorporation oi 
methionine and hence enhance detection of radioactive isotope. That is. primers Left- (SEQ ID NO: 3), and Right+ 
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, (SEQ ID NO: 4) containing a region desired to be mutated were synthesized. First, using the DNA of the above 2) 
as a template! It was, Amplified by PCR with primers Left* and .Left-, and the amplified DNA was designated as 
"Left". Ampltficatibn by PCR was also performed with primers Right*, and Right-, and the amplified DNA was des- 
ignated as "Right" After 5% polyacrylamide denatured gel electrophoresis, "Left" and "Right" were excised form 
5 the gel, and extracted. The excised Left and Right were first amplified by PCR under the same conditions as men- 
tioned above without primers. Further, 1 ul taken from this, reaction mixture was used as a template, and it was 
amplified by PCR under the same conditions with primers Left+ and Right-. From the above procedure, DNA of the 
mutated 4 repeats portion in which the number of methionine was increased from one to four was prepared. 

to 6. Ligation of alanine suppressor tRNA (Ala-sup tRNA) containing spacers having different lengths to 4 repeats portion 
[0097] . , . . ■ < 

1) The BamH\ site located in the 3'-termlnal-end sequence of the above 4 repeats portion obtained in the above A 
is was digested with BamM . Then, to remove 3'-terminal-end fragment at the BamHI site, only the 4 repeats portion 

in 5 '-terminal-end sequence was extracted and purified by using QIAquick PCR Purification Kit (QIAGEN). 

2) The purified product of the above 1), and Spacer-A (SEQ ID NO: 5) which was phosphoryiated at 5 -terminal end, 
by T4 kinase were ligated by using T4 DNA tigase while they were backed with Spacer-B (SEQ ID NO: 6). 

3) Spacer-C (SEQ ID NO: 7) which was phosphoryiated by T4 kinase, and Spacer-B which had a region comple- 
te mentary to Spacer-C were ligated by using T4 DNA ligase. The reaction was performed at 15°C for 2 hours. Then, 

the product was purified by ethanol precipitation. 

4) The products of the above sections 2) and 3), Spacer-D (SEQ ID NO: 8), and sup tRNA (SEQ ID NO: 9) which 
was phosphoryiated at 5 -terminal end were dissolved in T4 DNA ligase buffer, denatured at 85°C for 2 minutes, 
and cooled on ice. After addition of T4 DNA ligase, it was allowed to react at 15°C for 2 hours, and subjected to 

25 phenol extraction, and ethanol precipitation. 

5) The product obtained in the above 4) was used as a template, and amplification was carried out by PCR using 
the primer Left+, and a primer 3 Pur- (SEQ ID NO: 10) under the conditions of 92°C/30 seconds tor denaturation. 
65°C/30 seconds for annealing, and 73°C/1 minute for elongation, which cycle was repeated 30 times. Th$ product 
was subjected to polyacrylamide denatured gel electrophoresis. Three regions A, B and C exhibiting different 

30 migration distances were excised, and DNA was extracted from the regions. 

6) The DNAs extracted from A, B and C in the above 5) and having different lengths were used as templates and 
amplification were again carried out by PCR under the same conditions, and lengths of the products were deter- 
mined by electrophoresis, and they were used as template DNA fa transcription. As a result, it was found that the 
numbers of Spacer-C inserted into each product of the fractions were 0-5 for the fraction c, 6-14 for the fraction b, 

35 and 1 5-1 8 for the fraction a. 

C. Preparation of RNA genome and ligation of dCpPur 

[0098] The regions A, B and C obtained in the above B was transcribed into RNA at 37° for 2 hours by using T7 
40 polymerase. Further, the dCpPur obtained in the above < 1 ) Preparation of 3'-terminal-end portion of the nucleic acid 
portion was phosphoryiated in the presence of ATP by using T4 polynucleotide kinase at 15°C for 24 hours, and ligated 
to the aforementioned transcribed RNA genome by using T4 RNA ligase at 4°C for 50 hours. From this procedure, an 
RNA genome comprising sup tRNA having puromycin at its 3'-terminal end could be constructed. 

45 (2) Preparation of genome for non-site-directed bonding 

A. Preparation of DNA and RNA tor mutated 4 repeats portion 

[0099] DNA of the mutated 4 repeats portion was prepared principally the same method as the aforementioned (1) 
so A. However, the termination codons were eliminated by changing the two termination codons, amber and ochre, to 
glutamine and lysine, respectively, and a new primer New/Right- (SEQ ID NO: 10) was synthesized in order to make 
the 3*-terminal-end sequence purine-rich, and used with Left* for PCR amplification. The amplification by PCR was per- 
formed under the conditions of 92°C/30 seconds for denaturation, 65°C/30 seconds for annealing, and 73°C/1 minute 
for elongation, which cycle was repeated 30 times. This DNA was used as a template to obtain an RNA genome through 
55 a reaction at 37°C for 2 hours utilizing T7 polymerase. 
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, £.UgationofSpacers1 to4 V 

[0100] After a reaction at 36°C for 1 hour with T4 polynucleotide kinase, of Spacer 1 which was a DIslA composed of 
21 nucleotides (SEQ ID NO: 1 1), Spacer 2 which was a DNA composed of 40 nucleotides (SEQ ID* NO: 12), Spacer 3 
5 which was a DNA composed of 60 nucleotides (SEQ ID NO; 1 3), or Spacer 4 which was a DNA composed of 80 nucle- 
otides (SEQ ID NO: 14), the RNA obtained in the above A was ligated to the spacers through a reaction at 10°C for 48 
hours using T4 RNA ligase. 

C. Ligation of peptide acceptor (P-Acceptor) 

[0101] A peptide acceptor (P-Acceptor, SEQ ID NO: 15), which was a chimeric nucleic acid composed of 21-nude- 
otide DNA and 4-nucleotide RNA, i.e., 25 nucleotides in total, was synthesized In order to enhance the Incorporation 
efficiency into ribosomes by Hgating it at its 3'-terminal end to dCpPur. To phosphoryiate the 5 -terminal end of P- Accep- 
tor, it was reacted at 36°C for 1 hour using T4 polynucleotide kinase. Then, the product was backed with Back3 ' (SEQ 
is ID NO: 16) having a complementary sequence thereto, and ligated to the 3-terminal end of each of the spacers pre- 
pared in the aforementioned 8 through a reaction at 16°C fa 2 hours using T4 RNA ligase. This P-Acceptor was, also 
directly ligated to the 3*-terminal end of the RNA obtained in the above A through a reaction at 10°C for 48 hours using 
T4 RNA ligase, and the product was designated as Non-Spacer genome. 

20 D. Ligation of dCpPur 

[0102] The dCpPur obtained in the above ( 1 Preparation of 3 '-terminal -end portion of nucleic acid portion was phos- 
phorylated by using T4 polynucleotide kinase at 1 5°C for 24 hours, and ligated to the 3*-terminal end of each of the 
genomes prepared in the above C by using T4 RNA ligase at 4°C for 50 hours. By this procedure, chimeric RNA 
25 genomes comprising puromycin at its 3'-terminaf end could be constructed. 

< 3 Optimization of nucleic acid portion 

A. She-directed method 1 

30 

[0103] Each of the RNA genomes prepared in the above (21(1), which were classified into each of the lengths cor- 
responding to the fractions a, b and c, was translated in 50 ul of E. coll cell-free translation system [E coll S30 Extract 
Systems for Linear Templates (Promega)] containing biotinylated lysyl tRNA (Promega), and after addition of 5 mg of 
streptavidin coated magnetic beads (Dynabeads, Dynal) to each reaction tube, it was incubated at room temperature 

35 fa 1 hour. Then, the Dynabeads were collected by a magnet, and the supernatant was aspirated. The remained Dyna- 
beads were washed 2 times with B&W buffer (1000 The beads were further washed twice with RT-PCR buffer (500 
ul), and resuspended in RT-PCR buffer (500 ul)- The suspension (50 was transferred into a 500-jil Eppendorf tube, 
and after the Dynabeads were immobilized with a magnet the supernatant was aspirated. To the remained Dynabeads, 
RT-PCR buffer, reverse transcriptase, and Taq polymerase [Access RT-PCR System (Promega)] were added. Reverse 

40 transcription was performed at 48°C for 1 hours, and PCR was performed unc&r the conditions of 94°C/30 seconds. 
65°C/40 seconds, and 68°C/1 minute and 40 seconds, which cycle was repeated 40 times, using primers of Right+ 
(SEQ ID NO: 4) and 3 Pur- (SEQ ID NO: 10). The results of the analysis of the fractions a, b and c by electrophoresis 
were shown is Figure 5. 

[0104] A band was detected from the group of the fraction c (Lane 3 in Figure 5). This band was separated from the 
45 gel by electrophoresis, ligated to "Left" having the T7 promoter and the Shine-Dalgarno region by PCR, and amplified 
by PCR using the primers Left+ (SEQ ID NO; 3) and 3 Pur- (SEQ ID NO: 10). This genome was designated as 
"Stranger". 

[01 05] Then, to examine whether the protein actually translated from Stranger was bonded to the mRNA portion (RNA 
genome portion), after transcription, it was ligated with pdCpPur at its 3 '-terminal end using T4 RNA ligase, dephospho- 
so rylated at 5 -terminal end of RNA using HK phosphatase (Epicentre) at 30°C for 1 hour, and labeled with fr- 32 P]-ATP 
using T4 polynucleotide kinase. The product was added to an E coll cell-free translation system as mRNA, and allowed 
to react at 37°C for 1 hour and 40 minutes. The results of 18% SDS-PAGE of the product are shown in Figure 6. From 
the results, it can be seen that the nucleic acid portion (genotype) and the protein portion (phenotype) were bonded to 
form in vitro viruses, i.e.. molecules that assign a genotype to a phenotype. at a rate of about 80% or more. 

55 

B. Non-site-directed method 

[0106] Because a short spacer was already used in the site-directed method, dCpPur, which had been phosphor- 
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ylated at 5 -terminal 'end In the presence of (r 32 P)-ATP using T4 polynucleotide Wnase ( ^6llgatedto the S'-terminal 
end of "Non-spacer RNA genome without a spacer through a reaction .using T4 RNA ligase at 4°C tor 50 hours; The 
product was added to, an £ coli ceil-free translation system together with mRNA encoding the ordinary 4 repeats, and 
allowed to react at 37*0 tor 1 hour and 30 minutes. This reaction mixture (10 nl) digested with ribonuclease T2. and an 
equal amount of the reaction mixture were electrophoresed by 18% SDS-PAGE, and analyzed by an image analyzer 
BAS2000 (Fujtfilm) (Figure 7). 

[0107] As a result, as tor the sample digested with ribonuclease T2, a band appeared at the same migration distance 
as that of the control the 4 repeats labeled with [^SJ-methionine, because the sample contained the released protein 
portion. On the other hand, as for the sample not treated, a band appeared above the 4 repeats protein, i.e., it was 
found that the band reflected a clearly larger molecular weight. This band did not correspond to the labeled mRNA Itself 
(about 400 nucleotides), because it migrated a longer distance than the tRNA. Therefore, jt was identified as a sub- 
stance corrposed of' bonded RNA arid protein. That is, these results demonstrated that the nucleic acid portion was 
bonded to the protein portion In a non-site-directed manner. < > • 



is Example 2: Preparation of in vitro virus (2) 



< 1 ) Preparation of 3'-terminal-end portion of nucleic acid portion 
(a) Synthesis of rCpPur (ribocytidyl(3 '-*5 /)puromycin Materials: 

[0108] Each material was purchased from the following manufacturers: Puromycin from Sigma. rC-p-amidite (AT-ben- 
zoyl-5 *-0-(4,4 '•dimethoxytrityl)-2 '-0-tert-butyldimethyisilyl)-cytidine-3 -0-(0-(2-cyanoethyl)-A/, AT-diisoprcpyl-phospho- 
ramidite]) from Japan PerSeptive, tetrazole from Japan Millipore, tetrabutylammonium fluoride from Akirich, QAE- 
Sephadex from Pharmacia, and silica gel tor chromatography from Merck. 

25 

Methods: 



[0109] Puromycin. (50 mg, 92 nmol) was dissolved in dry pyridine (2 ml), and dehydrated by evaporation under 
reduced pressure. This procedure was repeated three times. To this, 4% tetrazole solution in acetonitrile {15 ml) was 

30 added, and the mixture was stirred at room temperature. The reaction was monitored by silica gel thin layer, chroma- 
tography (TLC, developing solvent: chloroform:methanol - 9:1). The reaction was usually finished in a day. After the 
reaction, the solvent was removed under reduced pressure. To the residue, 0.1 M solution of iodine in tetrahydro- 
furan/pyridine/water (80:40:2, 3 ml) was added, and the formed phosphite triester was oxidized with stirring at room 
temperature. One and a half hours later, the solvent was removed under reduced pressure, and the residue was 

35 extracted with chloroform. The extract was dried over anhydrous magnesium sulfate, and the solvent was removed 
under reduced pressure. The residue was subjected to silica gel column chromatography, and eluted with chloro- 
form/methanol - 90:10. The ribocytidylpuromycin (CpPur) having protection groups was eluted by silica gel TLC (devel- 
oping solvent; chloroform:methanol - 9:1) at an Rf of 0.32. Then, protection groups were removed. The 
ribocytidylpuromycin having protecting groups was first treated with 80% aqueous solution of acetic acid (0.5 mi) at 

40 room temperature for 1 hour. After the acetic acid was removed under reduced pressure, to the residue, a mixed solu- 
tion of aqueous ammonia/ethanol - 2: 1 (0.5 ml) was added. After the mixture was left at room temperature for 1 5 hours, 
the solvent was removed under reduced pressure, and to the residue, 1 M solution of tetrabutylammonium fluoride in 
tetrahydrofuran (0.5 ml) was added to remove p-cyanoethyt group. Thirty minutes later, the solvent was removed under 
reduced pressure, and the residue was subjected to column chromatography using QAE-Sephadex, and eluted with 0 

45 to 0.5 M linear gradient of triethylamine carbonate. The eluent was collected and lyophilized to afford 10 mg of ribocyti- . 
dylpuromycin. The synthesized product was confirmed to be ribocytidylpuromycin by the facts that it afforded equimolar 
amounts of cytidine and puromycin-5 '-phosphate after digestion with nuclease P1, and that molecular ions of [M+H] + 
were appeared at m/z 777 in MALDI/TOF mass spectrometry. 



so (2) Nucleic acid portion (preparation of in vitro virus genome) 



Materials: 



[01 10] A ceil-free protein synthesis system of rabbit reticulocyte lysate (Nuclease treated Rabbit reticulocyte lysate) 
55 was purchased from Promega. T7 RNA polymerase, T4 DNA ligase, T4 RNA ligase, T4 polynucleotide kinase, human 
placenta ribonuclease inhibitor, EcoRI, BamHl, and deoxyribonucleotides were purchased from Takara Shuzo. Restric- 
tion enzymes SsrNI, and BglW were purchased from New England Labs. As for [ 35 S]-metNonine. and [ 32 P]-y ATP, those 
from Amersham, and as for Taq DNA polymerase, those from Kurabo and Grainer were used. As for the other biochem- 
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leal reagents, those from Sigma and Wbkb pure Chemicals were used. A plasmid containing the N-terminal half region 
of human tau protein (amino acid residue numbers 1-165) (pAR3040) was prepared by picking up the full length gene 
of human tau protein by PCR method from a cDNA library of human brain cloned In X ZAPII, Introducing the gene Into 
a plasmid, arrplifying only the N-terminal half region by PCR, and Introducing the amplified product Into a plasmid. As 
the PCR (polymerase chain reaction) apparatus, Model ASTEC PC800 (Astec) was used. 

(1) Preparation of genome 

A. Preparation of DNA for N-terminal half region , 

[01111 mRNAs encoding the N-termlnal half region of human tau protein (amino acid residues M65) with or without 
stop codon, which were ligated with spacer, peptide acceptor, and rCpPur at its' 3-terminal end, were constructed as 
follows (Figure 8). 

J 

1) A plasmid into which the N-terminal half region of human tau protein (Goedert, M. (1989) EMBO J. 8, 392-399) 
(pAR3040) was introduced, was linearized by digestion with a restriction enzyme Bgt\l 

2) The N-terminal half region (amino acid residue numbers 1-165) was amplified from the above genome by PCR. 
As primers, used were Leftl (SEQ ID NO: 18) for the 5' side, and Rightl (SEQ ID NO: 19) with stop codon, or 
Right2 (SEQ ID NO: 20) without stop codon for the 3' side. The PCR conditions consisted of 92°C/30 seconds for 
denaturation, 65°C/30 seconds for annealing, and 73°C/1 minute for elongation, and this cycle wa6 repeated 30 
times. 

3) A DNA sequence composed of the promoter region of T7 RNA polymerase, Kozak sequence, and DNA 
sequence corresponding to amino acid residue numbers 1-25 of human tau protein connected in this order (SEQ 
ID NO: 21) was prepared by chemical synthesis. 

4) The two kinds of purified DNA obtained in the procedures o1 the above 2) and 3) were connected by a two-step 
PCR as follows. That is, a mixture of the aforementioned two kinds of DNA was first amplified in the absence of 
primer, and subsequently amplified in the presence of primers of Left2 (SEQ ID NO: 22) and Rightl (SEQ ID NO: 

, 19) or Right2 (SEQ ID NO: 20). From the above procedure, a DNA containing the promoter of T7 RNA polymerase 
and Kozak sequence at an upstream side of ORF of the N-terminaf half region of human tau protein was prepared. 
An RNA was obtained through a reaction using this DNA as template, and T7 RNA polymerase at 37°C for two 
hours. 

B. Ligation of Spacer and peptide acceptor 

[01 1 2] Spacer 5 (SEQ ID NO: 23) and a peptide acceptor (P-Acceptor, SEQ ID NO: 1 5), which was a chimeric nucleic 
acid composed of 21 -nucleotide DNA and 4-nudeotide RNA, i.e., 25 nucleotides in total, was chemically synthesized. 
The 5 '-terminal end of the peptide acceptor was phosphorylated by reaction at 36°C for 1 hour using T4 polynucleotide 
kinase, and the peptide acceptor was backed with a splint DNA (SEQ ID NO: 24) having a sequence complementary 
thereto, and ligated to the 3'-terminal end of Spacer 5 through a reaction at 16°C for 2 hours using T4 DNA ligase. 

I 

C. Ligation of RNA and Spacer-peptide acceptor 

[01 1 3] The ligation product of Spacer 5-peptide acceptor obtained in the above B was phosphorylated at the 5 -ter- 
minal end through a reaction at 36°C for 1 hour using T4 polynucleotide kinase, and ligated to the RNA obtained in the 
45 above A through a reaction at 4°C for 48 hours using T4 RNA ligase. 

D. Ligation of rCpPur 

[0114] The rCpPur obtained in the above ( 1 Preparation of 3*-terminal end portion of nucleic acid portion was phos- 
so phorylated through a reaction at 1 5°C for 24 hours using T4 polynucleotide kinase, and ligated to the 3*-terminal end of 
the genome prepared in the above C through a reaction at 37°C for 30 minutes using T4 RNA ligase. From this proce- 
dure, a chimera RNA genome having puromycin at its ^-terminal end could be constructed. 

E. Bonding of rCpPur to C-terminal end of N-terminal half of human tau protein 

55 

[01 1 5] It is considered that, in obtaining effective binding of a C-terminal end of a protein and an RNA encoding it, the 
distance between puromycin and a stop codon, and presence or absence of a stop codon would become important fac- 
tors. Therefore, in order to examine effects of these factors, the following three kinds of genomes, mRNAs encoding the 
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N-terminal half of human tau protein each (1) having a stop codon but not having a DNA spacer, (2) having neither of a 
stop codon and a DNA spacer, and (3) not 'having a stop codon but having a DNA spacer, at the a-terrrinal end were 
prepared. By using these three Kinds of genomes, protein synthesis Was performed in the presence of rCpPur labeled 
with M P in a cell-free translation system utilizing rabbit reticulocyte tysate (Figure 9). it was found that when the 3 -ter- 

5 minal end did not have a DNA spacer. rCpPur was bound to the C-terminal of the protein with a similar efficiency regard- 
less of the presence or absence of a stop codon. That is, in SDS-PAGE (SDS- polyacrylamide gel electrophoresis), the 
bands of the proteins bonded to rCpPur (the first and the second lanes from the left in Figure 9) appeared at the same 
location as that of the protein monomer labeled with [ 3S S]-methionine (the most right lane in Figure 9). On the other 
hand, if mRNA had a DNA spacer, rCpPur was bonded to the C-terminai end of the protein at an efficiency about three 

ro times higher than those obtained in the former two kinds of mRNA even without a stop codon (the third lane from the 
left, Figure 9). This result can be considered to indicate that translation pausing of rfoosome occurred on the DNA 
sequence, and as a result, rCp^ur and the protein could be bonded efficiently. Further, this result suggests that, when 
a genome without a stop codon having a DNA spacer and rCpPur at'rts 3Merminal end is used as mRNA in a cell-free 
translation system, puromycin at the 3 -terminal end of mRNA can efficiently be bonded to the C-terminai end of the cor- 

i5 responding translated protein. 

< 3 )Construction of in vitro virus in cell-free translation system 

[01 1 6] The genome constructed in the above < 2 )Nucleic acid portion (preparation of in vitro virus genome) composed 

20 of /nRNA encoding the N-terminal half of human tau protein (1 -1 65), DNA spacer (105 mer), peptide acceptor and rCp- 
Pur was translated by. using rabbit reticulocyte lysate. When incorporation of [^SJ-methionine into the protein was 
examined first by using an RNA encoding the N-terminal half (1-165) of human tau protein as mRNA, bands appeared 
at locations corresponding to monomer (about 28 KDa) and dimer (about 55 KDa) of the N-terminal half (1-165). In this 
case, the monomer was the major product, and the dimer was observed in an extremely small amount (the first left lane 

25 in Figure 10(A)). This result indicates that the RNA encoding the N-terminal half of human tau protein (1-165) func- 
tioned as mRNA. When the genome composed of mRNA encoding the N-terminal half (1-165) of human tau protein. 
DNA spacer (105 mer), peptide acceptor and rCpPur was translated in a similar cell-free translation system containing 
[ 35 S]-methionine, and-the products were analyzed in time course (5 minutes. 10 minutes, 20 minutes, and 40 minutes), 
a new wide band appeared at a location slightly above that of the genome in addition to the bands at the location of the 

30 monomer and dimer (the first right lane in Figure 10(A)). The intensity of this band increased with increase of reaction 
time (the second to fifth lanes from the left in Figure 10(A)), and increase of the genome amount (Lanes 3 and 4 in Fig- 
ure 10(B)). These results indicate that the genome was bonded to the C-terminal end of the protein with a covalent bond 
through puromycin. This also means that a genotype was covalently bound to a phenotype. That is, a molecule that 
assigns a genotype to a phenotypes was formed. The present inventors designated this assigning molecule as in vitro 

35 virus. When the effect of the length of DNA spacer on the formation of in vitro virus was examined, it was found that the 
in vitro virus was not famed efficiently with a length of about 80 mer, and it required a length of at least 100 mer. 
10117] Further, the generation of in vitro virus was confirmed by using rCpPur labeled with 32 P. That is, a genome 
composed of mRNA encoding the N-terminal half (1-165) of human tau protein, DNA spacer (105 mer), peptide accep- 
tor, and [ 32 P]-rCpPur was translated by using rabbit reticulocyte lysate. The bonding of the genome and the protein was 

40 confirmed by digestion with mung bean nuclease. That is, when the translation product (Lane 3 in Figure 11) was 
digested with mung bean nuclease, bands appeared at the locations corresponding to monomer and dimer (Lane 1 in 
Figure 11) of the N-terminal half (1-165) of human tau protein (Lane 4 in Figure 1 1). This indicates that the rCpPur 
labeled with 32 P was attached to the 3'-terminal end of the protein. Also from this result it was confirmed that the 
genome was bound to the C-terminal end of the protein with a covalent bond through puromycin. The efficiency of the 

45 binding was estimated to be about 10%. Because in vitro virus genome having a concentration of 40 to 100 pmol/ml 
can be prepared, generated in vitro viruses would consist of a population containing 2.4 to 6 x 10 12 of mutants, and this 
number corresponds to 10000 times of that obtained in the phage display method (Scott, J. K. & Smith, Q. P. (1990) 
Science 249, 386-390). The genotype assignment to phenotype has advantages, for example, it eliminates the problem 
concerning the permeability, and it enables incorporation of various non-naturally-occurring amino acids, and therefore 

so it enables synthesis of an extremely large number of mutants, or creation of various functional proteins. 

Example 3: Protein evolution simulation method utilizing in vitro viruses 

[0118] The protein evolution simulation method utilizing in vitro viruses comprises, as shown in Figure 12, (1) con- 
55 struction of in vitro virus genomes, (2) completion of in vitro viruses. (3) selection process, (4) introduction of mutation, 
and (5) amplification, and it allows modification and creation of functional proteins. In particular, repetition of these steps 
allows efficient modification and creation of functional proteins. Among these steps, the steps of (1) and (2) were spe- 
cifically explained in Examples 1 and 2 mentioned above. Therefore, the steps (3), (4) and (5) will be explained in this 
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example 

[01191 First it was examined whether peptides specific to an antibody were selected. Specifically, mouse IgG was 
used as the antibody, and the known 22 region of protein A (Nilsson, B, ( et al.. (1987) Protein Eng.. 1, 107-113) was 
used as a peptide sequence to be specifically bound to the antibody. As a control, N-termlnal region (M05) of human 
tau protein (Goedert M. (1989) EMBO J. 8, 392*399) was used. According to the construction methods of in vitro 
viruses described in the above Examples 1 and 2, in vitro virus genomes encoding the 22 region of protein A and the 
N-terminal region (1-105) of human tau protein were prepared. With different ratios of the in vitro virus genome encod- 
ing the 22 region of protein A and the in vitro virus genome encoding the N-terminal region (1 -1 05) of human tau protein 
varying as 1:1. 1:10. 1:100, 1:1000 or the tike, they were translated in a cell-free translation 6ystem utflizing rabbit retic- 
ulocyte lysate at 30°C for 10 minutes. Then, the translation product was diluted, and centrlfuged to remove insoluble 
fractions, and the supernatant was added to a micropiate coated with mouse JgQ (blocked with bdvine serum albumin 
beforehand), and left stand at 4°C for 2 hours. Then, the translation product was removed from the micropiate, and it 
was washed with a washing buffer (50 mM TVis/acetic acid, pH 7.5, 150 mM NaCt, 10 mM EDTA, 0.1% TWeen 20) for 1 6 
times in total, and eluted 2 times with an elution buffer (1 M acetic acid, pH 2.8). The eluted solution was subjected to 
ethanol precipitation, and the precipitates were dissolved in sterile water (20 ^1), and used as a template tor reverse 
transcription PCR. The reverse transcription PCR was performed by using reverse transcriptase (Avian Miefoblastosis 
Virus Reverse Transcriptase, Promega), DNA polymerase (Tfl DNA Polymerase, Promega). and RT+ (SEQ ID NO: 25) 
and RT- (SEQ ID NO: 26) as primers/Following a reaction at 48°C for 40 minutes, the reverse transcriptase was inac- 
tivated with a treatment at 94°C tor 5 minutes, and a cycle of 94°C for 30 seconds, 66°C tor 40 seconds, and 72°C for 
40 seconds was repeated 30 times. The obtained PCR product was electrophoresed at 55°C on 4% polyaaylamide gel 
containing 8 M urea, and observed by silver staining. As a result, it was found that the in vitro virus genome containing 
the 2Z region of protein A could be amplified even in an amount of one-100th of the control genome, i.e., the in vitro 
virus genome containing the N-terminal region (1-105) of human tau protein. This result indicates that the in vitro virus 
genome containing the 22 region of protein A was specifically bound to mouse IgG through the translated 22 region of 
protein A. Therefore, H was confirmed that the in vitro viruses could be selected. Introduction of mutation and amplifica- 
tion can be performed by using the already-established error-prone PCR (Leung, D. W., et al.. (1989) J. Methods Cell 
Mol. Biol. 1, 11-15), Sexual PCR (Stemmer. W. P. C. (1994) Proc. Natl. Acad. Sci. If USA 91. 10747-10751) or the like. 
Therefore, it was verified that the protein evolution simulation method shown in Figure 12 was feasible. 



30 Industrial Applicability 

[0120] According to the present invention, a molecule assigning a genotype (nucleic acid portion) to a phenotype (pro- 
tein portion), and construction methods therefor are provided. There are also provided a protein evolution simulation 
method utilizing molecules that assign a genotype to a phenotype (in vitro viruses) constructed according to the present 

35 invention, which comprises selecting the in vitro viruses by the in vitro selection method, amplifying the gene portion of 
an extremely small amount of the selected in vitro viruses by reverse transcription PCR, and further performing ampli- 
fication while introducing a mutation, and the like. The molecule assigning the genotype to the phenotype, the protein 
evolution simulation method utilizing it and the like of the present invention are an extremely useful substance or exper- 
imental system for evolutionary molecular engineering, i.e., modification of functional biopolymers such as enzymes, 

40 antibodies, and ribozymes, and creation of biopolymers having functions which cannot be found in living organisms. 
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SEQUENCE, LISTING * 
(1 ) GENERAL INFORMATION: 

(i) APPLICANT: Mitsubishi Chemical Corporation' 
" (i i ) TITLE OF INVENTION: Molecule assigning genotype to phenotype and use 
thereof 

(iii) NUMBER OF SEQUENCES: 26 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADORESSEE: Mitsubishi Chemical Corporation 

(B) STREET:* 5-2. Marunouchi 2-chome 

(C) CITlh Chiyoda-ku 

(0) STATE: Tokyo ' ' 

(E) COUNTRY: JAPAN 

(F) ZIP: 103 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) (X>fUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-OOS/MS-OOS 
(0) SOFTWARE: Patentln 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 17.10.97 

(C) CLASSIFICATION: 

(D) . REFERENCE: E97128W623 
(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMJER: JP 1996274855 

(B) FILING DATE: 17.10.96 
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(2) INFORMATION FOR SEO ID NO:1: 
(i) SEOUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic ONA 
(ix) FEATURE: 

(D) OTHER INFORMATION: T7 promoter upstream 
(xi) SEQUENCE DESCRIPTION: SEO ID N0:1: 

GAGCATAGAT CTCGATCCCG CGAAATTAAT ACG 



34 
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(2) INFORMATION FOR SEO 10 N0:2: 
(i) SEOUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(0) TOPOLOGY: linear 
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(ii) MOLECULAR TYPE: other nucleic acid ' 

(A) DESCRIPTION: synthetic ONA 
(ix) FEATURE: 

(0) OTHER INFORMATION: includes a termination codon 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
GCAGCCGGAT CCTTACTACT TGTGGGTTTC AAT 



33 



10 



20 



25 
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(2) INFORMATION FOR SEQ ID N0:3: . 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic DNA 
(ix) FEATURE: 

(0) OTHER INFORMATION: includes an initiation codon; complementary 
to SEQ 10 NO: 4 

(xi) SEQUENCE DESCRIPTION: SEQ 10 MO: 3: 
GGACATGACA TTCATCATGT CTGGCATATG TAT 33 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic ONA 
(ix) FEATURE: 

(D) OTHER INFORMATION: includes an initiation codon; complementary 
to SEQ ID NO: 3 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 4: 



ATACATATGC CAGACATGAT GAATGTCATG TCC 



33 



(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic DNA 
(ix) FEATURE: 

(0) OTHER INFORMATION: has a portion complementary to SEQ 10 NO: 
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(xi) SEQUENCE DESCRIPTION: 'SEQ ID NO: 5: 
6ATCTATTTC TTATTC 



16 



(2) INFORMATION FOR SEQ 10 NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 
' (0) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: other nucleic acid ' ■ 

(A) DESCRIPTION: synthetic ONA 
(ix) FEATURE: 

(0) OTHER INFORMATION: has an initiation codon; complementary to 
SEQ 10 NO: 5 

(xi) SEQUENCE DESCRIPTION: SEO ID N0:7: 
GAAGAGAATA AGAAATA . 17 

(2) INFORMATION FOR SEQ ID N0:7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic DNA 
(ix) FEATURE: 

(D) OTHER INFORMATION: has a portion complementary to SEO ID NO: 



35 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TCTTCTATTT CTTATTC . 
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(2) INFORMATION FOR SEQ ID NO :8: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 

(8) TYPE: nucleic acid ,' 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic DNA 
(ix) FEATURE: 

(D) OTHER INFORMATION: has a portion complementary to SEO ID NO: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 
GGGTAAACGA ATGAACAAGA ATAAGAAATA 



30 



(2) INFORMATION FOR SEO ID NO:9: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic ONA 
(ix) FEATURE: , ; . ...... 

(D) OTHER INFORMATION: has a sequence of an alanyl tRNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TTGTTCATTC GTTTACCCGG GGCTATAGCT CAGCTGGGAG AGCGCCTGCT TCTAACGCAG 60 
GAGGTCTGCG GTTCGATCCC GCGTAGCTCC ACCAGGAGGC GACTAGCT 108 

(2) INFORMATION FOR SEQ 10 NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(6) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic ONA 
(ix) FEATURE: 

(D) OTHER INFORMATION: has a 3' -side sequence of an alanyl tRNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: , 
GTGGAGCTAC GCGGGATCGA ACC 23 



(2) INFORMATION FOR SEO ID NO:11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic DNA 
(ix) FEATURE: 

(0) OTHER INFORMATION: has no initiation codon 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GCAGCCGGAT CCTTTCTGCT TGTGG 25 

(2) INFORMATION FOR SEQ ID M0:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: other nucleic acid 



55 



. (A) DESCRIPTION: synthetic ONA 
(ix), FEATURE: ' ' 

5 (D) OTHER INFORMATION: has a sequence partly complementary to SEQ 

ID NO: 17 

(xi) SEQUENCE DESCRIPTION: SEO ID NO: 12: 
CTTTAATGAC CTCCCCTCTC C 21 

10 (2) INFORMATION FOR SEQ ID NO: 13: . 

. (i) SEQUENCE CHARACTERISTICS: , ! 
(A) LENGTH: 40 
, (B) TYPE: nucleic acid 
t3 (C) STRANDEONESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULAR TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic DMA 
(ix) FEATURE: 

no (D) OTHER INFORMATION: has a sequence partly complementary to SEQ 

ID NO: 17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CTTTAATAAT 1 1 T 1 1 1 1 1 1 1 TTTAATGACC TCCCCTCTCC 40 

25 (2) INFORMATION FOR SEQ ID N0:U: 

(i) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 60 

(8) TYPE: nucleic acid 
30 (0 STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic ONA 
(ix) FEATURE: 

35 (D) OTHER INFORMATION: has a sequence partly complementary to SEQ 

10 NO: 17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 
CTTTAATAAT l ll lllllll 1 1 1 1 1 1 ITTT TTTTTTTTTT TTTAATGACC TCCCCTCTCC 60 

40 i 

* (2) INFORMATION FOR SEQ 10 N0:15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 

(B) TYPE: nucleic acid 
45 (C) STRANDEONESS: single 

(0) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic DNA 
so (ix) FEATURE: 

(D) OTHER INFORMATION: has a sequence partly complementary to SEO 

ID NO: 17 
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(xi) SEQUENCE DESCRIPTION: SEQ 10 NOtlS:,' 
CTTTAATAAT TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT 
TTTAATGACC TCCCCTCTCC 

(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic DNA 
(ix) FEATURE: 

(C) LOCATION: 22.. 25 

(0) OTHER INFORMATION: RNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



60. 
80 



CTTACTGTCT TTTT 



17 TGAGC 



Z5 



25 
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(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic DNA 
(ix) FEATURE: 

(D) OTHER INFORMATION: has a sequence partly complementary to SEQ 

ID NO: 16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AAAAAAGACA GTAAGGGAGA GGGGAGGTCA TTA 33 



(2) INFORMATION FOR SEQ ID NO:18: 
< 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULAR TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic DNA 
(ix) FEATURE: 

(D) OTHER INFORMATION: includes an N-terminal initiation codon in 
so an N-terminal -half region of a tau protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
ATGGCTGAGC CCCGCATGGA GTTC ' 24 




10 
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(2) INFORMATION FOR SEO ID NO: 19:.., • , . . 

(4). SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 
. . (ii ) MOLECULAR TYPE: other nucleic acid 

. (A) DESCRIPTION: synthetic ONA 
. (ix) FEATURE: 

(0) OTHER INFORMATION: includes a C-terminal termination codon in 
an N-terminal-half region of a tau protein 

(xi) SEQUENCE DESCRIPTION: SEO ID NO: 19: 
CTCTGCCACT TACTAGGGCT CCCG 24 

(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 
(0) TOPOLOGY: linear 

2S (ii) MOLECULAR TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic ONA 
(ix) FEATURE: 

(0) OTHER INFORMATION: includes a C-terminal termination codon in 
an N-terminal-half region of a tau protein 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

CTCTGCCACC TTCTTGGGCT CCC 23 

(2) INFORMATION FOR SEQ ID NO:21: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 118 
(8) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

*o (ii) MOLECULAR TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic DMA 
(ix) FEATURE: 

(D) OTHER INFORMATION: includes a promoter region of T7 RNA 
polymerase, a kozak sequence, and a DNA sequence corresponding to amino acid 
numbers 1-25 of a human tau protein 

(xi) SEOUENCE DESCRIPTION: SEQ ID NO: 21: 
GATCCCGCGA AATTAATACG ACTCACTATA GGGAGACCAC AACGGTTTCC CTCTAGAAAT 60 
AATTTTGTTT AACTTTAAGA AGGAGATGCC ACCATGGTTG AGCCCCGCAT GGAGTTCG 118 
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(2) INFORMATION FOR SEQ ID NO:22: 
(i) SEOUENCE CHARACTERISTICS: 
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(A) LENGTH: 30 ■• i' '• 

(B) TYPE: nucleic acid 

4 (0 STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULAR TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic DNA 
(ix) FEATURE: 

» (0) OTHER INFORMATION: a 5* -end region of 17, RNA polymerase 

I including a part of a promoter thereof 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
GATCCCGCGA AATTAATACG ACTCACTATA 

(2) INFORMATION FOR SEQ ID N0:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 
(8) TYPE: nucleic acid 
x (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic DNA 
(ix) FEATURE: 

25 (D) OTHER INFORMATION: Spacer 5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

AAGCCACTCG CGTGGTCTCG CATTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT 
TTTTTTTTTT TTTTTTTTTT TTTTTTTTAA TGACCTCCCC TCTCC 

30 

(2) INFORMATION FOR SEQ 10 NO:24: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULAR TYPE: other nucleic acid 
(A) DESCRIPTION: synthetic DNA 
40 (ix) FEATURE: | 

(D) OTHER INFORMATION: splint DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
AAAGACAGTA AGGGAGAGGG GAGGT 



(2} INFORMATION FOR SEQ 10 NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: other nucleic acid 
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(A) DESCRIPTION; synthetic ONA 
(ix) FEATURE: 

* (0) OTHER INFORMATION; a primer for reverse transcription PCR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GGTTTCCCTC TAGAAATAAT TTTGTTTAAC TTTAA6AAG6 ' A6ATATA 47 

« (2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: - 

(A) LENGTH: 25 

(B) TYPE: nucleic acid 

is (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic ONA 
20 (ix) FEATURE: 

(D) OTHER INFORMATION: a primer for reverse transcription PCR 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
AGCTTTCAGG CCAGCGTCCG TGTCA 25 

25 



Claims 

30 

1. A molecule assigning a genotype to a phenotype, which comprises a nucleic acid portion having a nucleotide 
sequence reflecting the genotype, and a protein portion comprising a protein involved in exhibition of the pheno- 
type. the nucleic acid portion and the protein portion being directly bound by a chemical bond. 

35 2. The assigning molecule according to daim 1 , wherein a 3'-terminal end of the nucleic acid portion and a C-terminal 
end of the protein portion are bonded with a covalent bond. 

3. The assigning molecule according to claim 1 or 2, wherein a 3*-terminal end of the nucleic acid portion covalentfy 
bonded to a C-terminal end of the protein portion is puromydn. 

40 i 

4. The assigning molecule according to any one of claims 1 to 3, wherein the nucleic add portion comprises a gene 

encoding a protein, and the protein portion is a translation product of the gene of the nudeic acid portion. 

5. The assigning molecule according to claim 4, wherein the nudeic add portion comprises a gene composed of 
45 RNA, and a suppressor tRNA bonded to the gene through a spacer. 

6. The assigning molecule according to claim 5, wherein the suppressor tRNA comprises an anticodon corresponding 
to a termination codon of the gene. 

so 7. The assigning molecule according to claim 4, wherein the nucleic acid portion comprises a gene composed of 
RNA, and a spacer composed of DNA and RNA. 

8. The assigning molecule according to claim 4, wherein the nucleic add portion comprises a gene composed of 
RNA, and a spacer composed of DNA and polyethylene glycol. 

55 

9. The assigning molecule according to claim 4, wherein the nudeic add portion comprises a gene composed of 
RNA, and a spacer composed of a double-stranded DNA. 
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10. The assigning molecule according to claim 4, wherein the nucleic add portion comprise? a gene composed of 
' RNA, and a spacer composed of a double strand of RNA and a fhort cliaiq peptide nucleic acid I (PNA) or DNA. 

11. The asking molecule according to claim ,4, wherein the nucleic acid portion comprises a gene composed of 
DNA, and a spacer composed of DNA and RNA. 

12. A method for constructing the assigning molecule as defined in claim 5, which comprises (a) bonding a DNA com- 
prising a sequence corresponding to a suppressor tRNA, to a 3'-terminat end of a DNA containing a gene through 
a spacer, (b) transcribing the obtained DNA bonded product into RNA, (c) bonding, to a 3Merminal end of the 
obtained RNA, a nucleoside of a substance having a chemical.structure analogous tofoat of a nucleoside, which 
can be covaleritly bound to an amino acid or a substance having a chemical structure analogous to that of an amino 
acid, and (d) performing protein synthesis in a cell-free protein synthesis system using the obtained bonded prod- 
uct as mRNA to bond a nucleic acid portion containing the gene to a translation product of the gene. 

13. A method for constructing the assigning molecule as defined in claim 7, which comprises (a) preparing a DNA con- 
taining a gene which has no termination codon, (b) transcribing the prepared DNA into RNA, (c) bonding a chimeric 
spacer composed of DNA and RNA to a 3Merminal end of the obtained RNA, (d) bonding, to a 3*-terminal end of 
the obtained bonded product, a nucleoside or a substance having a chemical structure analogous to that of a nucl- 
eoside, which can be covalently bound to an amino acid or a substance having a chemical structure analogous to 
that of an amino acid, and (e) performing protein synthesis in a cell-free protein synthesis system using the 
obtained bonded product as mRNA to bond a nucleic acid portion containing the gene to a translation product of 
the gene. 

1 4. The construction method according to claim 1 2 or1 3. wherein the nucleoside or the substance having the chemical 
structure analogous to that of the nucleoside is puromycin. 

15. A method for constructing the assigning molecule as defined in claim 8, which comprises (a) preparing a DNA con- 
* " taining a gene which has no termination codon, (b) transcribing the. prepared DNA into RNA. (c) bonding a chimeric 

spacer composed of DNA and polyethylene glycol to a 3' -terminal end of the obtained RNA, (d) bonding, to a 3'- 
terminal end of the obtained bonded product, a nucleoside or a substance having a chemical structure analogous 
to that of a nucleoside, which can be covalently bound to an amino acid or a substance having a chemical structure 
analogous to that of an amino acid, and (e) performing protein synthesis in a cell-free protein synthesis system 
using the obtained bonded product as mRNA to bond a nucleic acid portion containing the gene to a translation 
product of the gene. 

1 6. A method for constructing the assigning molecule as defined in claim 9. which comprises (a) preparing a DNA con- 
" taining a gene which has no termination codon, (b) transcribing the prepared DNA into RNA, (c) bonding a spacer 

composed of double-stranded DNA to a 3*-terminal end of the obtained RNA. (d) bonding, to a 3'-terminal end of 
the obtained bonded product, a nucleoside or a substance having a chemical structure analogous to that of a nucl- 
eoside, which can be covalently bound to an amino acid or a substance having a chemical structure analogous to 
that of an amino acid, and (e) performing protein synthesis in a cell-free protein synthesis system using the 
obtained bonded product as mRNA to bond a nucleic acid portion containing the gene to a translation product of 
the gene. 

17. A method for constructing the assigning molecule as defined in claim 10, which comprises (a) preparing a DNA 
containing a gene which has no termination codon, and a nucleotide sequence of a spacer, (b) transcribing the pre- 
pared DNA into RNA, (c) bonding, to a 3'-terminal end of the obtained RNA, a nucleoside or a substance having a 
chemical structure analogous to that of a nucleoside, which can be covalently bound to an amino acid or a sub- 
stance having a chemical structure analogous to that of an amino acid, (d) adding a short chain PNA or DNA to a 
3'-terminal end side portion of the gene in the obtained RNA bonded product to form a double-stranded chain, and 
(e) performing protein synthesis in a cell-free protein synthesis system using the obtained bonded product as 
mRNA to bond a nucleic acid portion containing the gene to a translation product of the gene. 

18. A method for protein evolution simulation, which comprises a construction step for constructing assigning mole- 
cules from a DNA containing a gene by the construction method as defined in any one of claims 12. 13. 15. 16 and 
17, a selection step for selecting the assigning molecules obtained in the construction step, a mutation introduction 
step for introducing a mutation into a gene portion of an assigning molecule selected in the selection step, and an 
amplification step for amplifying the gene portion obtained in the mutation introduction step. 
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Id. The method for protein evolution si^ 18, wherein the construction step; the selection step, 

the mutation Introduction step and the anptificatioh step are repeatedly performed by providing the DNA obtained 
in the amplification step to the construction step. ' - * , v , > 

20. A method for assaying protein/protein or protein/nucleic acid intermolecular action, which comprises a construction 
step for constructing assigning molecules by the construction method of any one as defined in claims 12, .13, 15, 
16 and 1 7, and an assay. step for examining intermolecular action of the assigning molecules obtained in the con- 
struction step with another protein or nucleic acid. 

21. An apparatus for performing the method for evolution simulation as defined in claim 18 or 19. which comprises a 
means for constructing assigning molecules, said means comprising a first bonding means for bonding a DNA 
comprising a sequence corresponding to a suppressor tRNA to a ^-terminal end of a DNA containing a gene 
through a spacer, a transcription means for transcribing the DIvJA bonded product obtained by the first bonding 
means into RNA, a second bonding means for bonding, to a 3' -terminal end of the RNA obtained by the transcrip- 
tion means, a nucleoside or a substance having a chemical structure analogous to that of a nucleoside, which can 
be covalently bound to an amino acid or a substance having a chemical structure analogous to that of an amino 
acid, and a third bonding means for performing protein synthesis in a cell-free protein synthesis system using the 
bonded product obtained by the second bonding means as mRNA to bond a nucleic acid portion containing the 
gene to a translation product of the gene, or a means for constructing assigning molecules, said means comprising 
a transcription means for transcribing a DNA containing a gene into RNA, a first bonding means for bonding a chi- 
meric spacer composed of DNA and RNA, a chimeric spacer composed of DNA and polyethylene glycol, a double- 
stranded spacer composed of DNA, or a double-stranded spacer composed of RNA and PNA or DNA to a 3*-ter- 
minal end of the RNA obtained by the transcription means, a second bonding means for bonding, to a 3'-terminal 
end of the RNA-spacer bonded product obtained by the first bonding means, a nucleoside or a substance having 
a chemical structure analogous to that of a nucleoside, which can be covalently bound to an amino acid or a sub- 
stance having a chemical structure analogous to that of an amino acid, and a third bonding means for performing 
protein synthesis in a cell-free protein synthesis system using the bonded product obtained by the second bonding 
means as mRNA- to bond a nucleic acid portion containing the gene to a translation product of the gene; a selection 
means for selecting the constructed assigning molecules; a mutation introduction means for introducing a mutation 
into a gene portion of an assigning molecule selected; and an amplification means for amplifying the gene portion 
to which the mutation is introduced. 
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